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UBXOOXTXB-XiYTXC FBPTZOS F0SX0N GOT CONSTRUCTS, PROTBXH PRODUCTS 

cmviwa ctkrkfrom, ahd hbtbods of maxxbo and usxhq same 

5 CKOSS RDBnCl TO MUBD APPLICATIONS 

This application is a continuation-in-part of Application No. 
08/221.730. filed April 20. 1994 in the names of Jesse M. Jaynes 
and Gordon R. Julian, which in turn is a continuation of 

10 Application No. 08/225,476. filed April 8, 1994 in the names of 
Jesse H. Jaynes and Gordon R. Julian, which is in turn a 
continuation of Application Ho. 08/148,491, filed November 8, 1993 
and Application No. 08/148,889, filed November 8, 1993, both filed 
in the name of Gordon R. Julian, which are in turn continuations 

15 of Application No. 08/039,620, filed June 4, 1993 in the name of 
Jesse M. Jaynes and Gordon R. Julian. 

BACKGROUND OF TBB ZHVBOTXON 

20 Fiplri r>f fhP Tnvpnfinn 

The present invention relates to ubiquitin-lytic peptide 
fusion gene constructs with enhanced stability and gene 
expression, ubiquitin-lytic peptide fusion protein products, and 
methods of making and using the same. 

25 

n»«i-T--inrinn of Rglaerei Art 

Naturally occurring lytic peptides play an important if not 
critical role as immunological agents in insects and have some, 
albeit secondary, defense functions in a range of other animals. 
30 The function of these peptides is to destroy prokaryotic and other 
non-host cells by disrupting the cell membrane and promoting cell 
lysis. Common features of these naturally occurring lytic 
peptides include an overall basic charge, a small size {23-39 
amino acid residues ) , and the ability to form amphipathic Ot- 

35 helices or S-pleated sheets. Several types of lytic peptides have 
been identified: cecropins (described in U.S. Patents 4,355,104 
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and 4,520,016 to Hultmark ec ai.), defensins, sarcocoxins, 
melittin, and nagainins (described in U.S. Patent No. 4,810,777 to 
Zasloff ) . Bach of these peptide types is distinguished by 
sequence and secondary structure characteristics. 
5 Several hypotheses have been suggested for the mechanism of 

action of the lytic peptides: disruption of the membrane lipid 
bilayer by the amphipathic a-helix portion of the lytic- peptide; 
lytic peptide formation of ion channels, which results in 
osmotically induced cytolysis; lytic peptide promotion of protein 

10 aggregation, which results in ion channel formation; and lytic 

peptide- induced release of phospholipids. Whatever the mechanism 
of lytic peptide-induced membrane damage, an ordered secondary 
conformation such as an amphipathic a-helix and positive charge 
density are features that appear to participate in the function of 

15 the lytic peptides. 

Active synthetic analogs of naturally occurring lytic 
peptides have been produced and tested in vitro against a variety 
of prokaryotic and eukaryotic cell types (see for example 
Arrowood, M.J., et ai.. J. Protozool. 38: 161s [1991]; Jaynes, 

20 J.M., et ai., FASEB J. 2: 2878 [1988]). including: gram positive 
and gram negative bacteria, fungi, yeast, protozoa, envelope 
viruses, virus-infected eulcaryotic cells, and neoplastic or 
transformed mamnalian cells. The results from these studies 
indicate that many of the synthetic lytic peptide analogs have 

25 similar or higher levels of lytic activity for many different 
types of cells, compared to the naturally occurring forms. In 
addition, the peptide concentration required to lyse microbial 
pathogens such as protozoans, yeast, and bacteria does not lyse 
normal mammalian cells. However, because previous work 

30 demonstrates that absolute sequence is not important as long as 
positive charge and amphipathy are preserved, the level of 
activity for a given synthetic peptide is difficult to predict. 

The specificity of the lytic action also depends upon the 
concentration of the peptide and the type of membrane with which 

35 it interacts. Jaynes, J.M. et ai.. Peptide Research 2: 157 (1989) 
discuss the altered cytoskeletal characteristics of transformed or 
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neoplastic mammalian cells chat make them susceptible to lysis by 
the peptides. In these experiments, normal, human non-transformed 
cells re m ai ned unaffected at a given peptide concentration while 
transformed cells were lysed; however, when Tinna.] cells were 
5 treated with the cytoskeletal inhibitors cytochalasin D or 

colchicine, sensitivity to lysis increased. Hie experiments show 
that the action of lytic peptides on normal mammalian cells is 
limited. This resistance to lysis was most probably due to the 
well-developed cytoskeletal network of normal cells. In contrast, 
10 transformed cell lines which have well-known cytoskeletal 

deficiencies were sensitive to lysis. Because of differences in 
cellular sensitivity to lysis, lytic peptide concentration can be 
manipulated to effect lysis of one cell type but not another at 
the same locus. 

15 Synthetic lytic peptide analogs can also act as agents of 

eukaryotic cell proliferation. Peptides that promote lysis of 
transformed cells will, at lower concentrations, promote cell 
proliferation in some cell types. This stimulatory activity is 
thought to depend on the channel-forming capability of the 

20 peptides, which somehow stimulates nutrient uptake, calcium influx 
or metabolite release, thereby stimulating cell proliferation (see 
Jaynes, J.M., Drug News & Perspectives 3: 69 [1990],- and Reed, 
W.A. ec al.. Molecular Reproduction and Development 31: 106 
[1992]). Thus, at a given concentration, these peptides stimulate 

25 or create channels that can be beneficial to the normal mammalian 
cell in a benign environment where it is not important to exclude 
toxic compounds . 

The synthetic lytic peptide analogs typically contain as few 
as 12 and as many as 40 amino acid residues. A phenylalanine 

30 residue is often positioned at the amino terminus of the protein 
to provide an aromatic moiety analogous to the tryptophan residue 
located near the amino terminus of natural cecropins and a uv- 
absorbing moiety with which to monitor the purification of the 
synthetic peptide. The basis for the design of these lytic 

35 peptide analogs is that a peptide of minimal length, having an 
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amphipathic o -helical structural or a &-pleated sheet motif, and 
overall positive charge density effects lytic activity. 

Plant disease is one of the leading causes of crop loss in 
the world and is estimated to cause up to one third of total crop 
5 loss worldwide; for example, in the potato losses associated with 
bacterial disease are as high as 25% of worldwide production. 
Additionally, the cultivation of a few species of plants in a 
concentrated area exacerbates the spread of disease. Recent 
advances in genetic engineering have lead to the development of 

10 plants with disease resistant phenotypes based on the expression 
of recombinant UNA molecules. Transgenic tobacco plants were 
engineered with both a wound inducible Pill promoter and a 
constitutive 35S promoter to express two lytic peptides (SHIVA-1 
and SB-37) with bacteriolytic activity. The SHIVA- 1 plant 

15 demonstrated enhanced resistance to bacterial wilt caused by 

infection by Pseudaman&s soianacearun <Jaynes, J.M. , et ai.. Plant 
Science 89: 43 (1993); Destefano-Beltran, h., et ai.. 
Biotechnology in Plant Disease Control, pp. 175-189, Wiley-Liss 
(1993) . Thus lytic peptides have valuable uses as anti- 

20 phytopathogenic agents. However, chemical synthesis of these 
lytic peptides is very expensive. Therefore, alternate, more 
economical and efficient methods of synthesis are needed, such as 
in vivo synthesis in host cells using recombinant DKA methods. 
Recombinant DKA molecules are produced by sub-cloning genes 

25 into plasmids using a bacterial host intermediate. In principle 
this technique is straightforward. However, any sequence that 
interferes with bacterial growth through replication or production 
of products toxic to the bacteria, such a lytic peptides, are 
difficult to clone. Often, host bacterial cells containing 

30 mutated forms of the DNA sequences encoding toxic products will be 
selected. These mutations can result in either decreased 
expression or production of an inactive product. Bacteria will 
even insert mutations that prevent expression of a potentially 
toxic product in cloned genes controlled by a eukaryotic promoter 

35 that is not active in prokaryotes . The effect of this selection 
of mutated species leads to an inability to isolate sub-clones 
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containing a non-mutated gene of choice. Thus, sane sub-cloned 
genes are unstable in their bacterial hosts, although this 
instability can only be shown empirically. The bacteriolytic 
activity of the lytic peptides is an obstacle to the production of 
5 stable recombinant na molecules that express the genes at high 
levels. 

For example, in an attempt to sub-clone into a standard 
plaamid vector a gene coding for frog magainin, a natural lytic 
peptide, bacterial transformants contained deletion nutations in 

10 the magainin coding region. Another attempt was made to sub-clone 
a synthetic lytic peptide (SEQ ID NO. 98) into a standard plaamid 
vector (pUC19) containing the Cauliflower Mosaic Virus 35S 
promoter. The resulting transformants were screened by polymerase 
chain reaction (ICR) . However, out of 30 colonies, only 2 sub- 

15 clones gave faint positive signals. These two sub-clones were 
sequenced. The sequence showed that one clone had a point 
mutation that introduced a stop codon 3/4 of the way through the 
lytic peptide, and the other clone had a point mutation that 
changed the start codon from methionine to isoleucine. Both 

20 nutations would prevent the biosynthesis of the protein. Four 
more clones were analyzed, and of these four, one was sub-cloned 
in the wrong orientation, and three others had mutations 
introduced into the sequence. One of these sub-clones was 
selected for further analysis, but it inhibited the growth of its 

25 B. coli host. Thus, the production of recombinant Ota molecules 
coding for lytic peptides is difficult due to the uncertainty in 
obtaining the 1 correct sub-clone . 

Ubiquitin is a small, highly conserved protein present in all 
eukaryotes. Ubiquitina are encoded by gene families that are 

30 characterized by two types of basic structures. Polyubiquitin 
genes contain several direct repeats of ubiquitin, and ubiquitin- 
ribosomal fusion genes encode a single ubiquitin unit fused to the 
coding region for a small ribosomal associated protein. Both of 
these gene types are translated as polyproteins and then are 

35 processed by an endogenous ubiquitin hydrolase present in 

eukaryotes to release multiple ubiquitin proteins or ubiquitin and 
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the riboaonal associated protein. A number of ubiquitin cDNAs or 
genomic clones have been isolated, including plant ubiquitin cGNAs 
and genomic clones from the potato (Garbarino, J. and Belknap, w. , 
Plant Molecular Biology 24: 119 (1994); Garbarino. J. ec al., 
5 Plant Molecular Biology 20: 235 (1992)). 

U.S. Patents 5,093,242 and 5.132,213 to Bachmair et ai. teach 
the use of a ubiquitin cloning vector as a method of producing 
specified protein amino- termini . A recombinant SNA molecule was 
constructed with a protein coding gene fused at its amino terminus 

10 to a ubiquitin coding gene. Due to translation as a polypeptide 
and cleavage by hydrolases, a protein with any amino acid at the 
amino terminus can be generated. The amino terminus can be used 
to control the metabolic stability of the protein. However, the 
metabolic stability of the protein is dependent on the resulting 

15 amino acid at the amino -terminus, not the generation of a 
translation polypeptide. 

The forgoing facts suggest that although lytic peptides as a 
class may include species that are efficacious in destroying 
bacteria, neoplastic cells, fungi, virus -infected cells, and 

20 protozoa, this lytic characteristic also decreases the stability 
of sub-cloned lytic peptides in host cells. This decreased 
stability hinders efforts to develop a more economical and 
efficient means of synthesizing lytic peptides. 

It would therefore be a significant advance in the art, and 

25 is correspondingly an object of the present invention to develop a 
method of sub-cloning nucleotide sequences coding for lytic 
peptides into* expression vectors, providing gene constructs with 
enhanced stability and gene expression and reduced toxicity. 

30 summary of tbe ixmam am 

The present invention relates generally to ubiquitin-lytic 
peptide fusion nucleic acid expression vectors comprising a 
promoter and ubiquitin polypeptide coding sequence ligated to a 
35 lytic peptide, ubiquitin-lytic peptide fusion protein products, 
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and methods of making and using the same, as hereinafter more 
fully described. 

It is another object of the invention to provide ubiquitin- 
lytic peptide fusion expression vectors and protein products 
5 derived therefrom. 

It is another object of the invention to provide ubiquitin- 
lytic peptide fusion expression vectors that are expressed in 
plants having utility for promoting wound healing and combatting 
bacterial infections in plants. 
10 It is a further object of this invention to provide 

ubiquitin- lytic peptide fusion polypeptides having utility for 
combatting protozoal infections, neoplasias , fungal infections, 
viral infections, and bacterial infections in mammals and plants. 

It is yet another object of this invention to develop a 
15 method of sub-cloning polypeptide sequences in ubiquitin-fusion 
expression vectors with enhanced stability and gene expression. 

It is yet another object of this invention to provide 
expression vectors containing constitutive and wound inducible 
ubiquitin promoters that are expressed in eukaryotic cells. 
20 It is yet another object of this invention to provide 

expression vectors with prolcaryotic promoters that express 
ubiquitin- lytic peptide fusion genes in prokaryotic hosts, the 
products of which can be cleaved in vitro by ubiquitin hydrolases . 

These and other objects and advantages will be more fully 
25 apparent from the ensuing disclosure and claims. 

BRXHF PEflCRTPTIOB OF THE CMNXBOS 

Figure 1 is a map of a recombinant nucleic acid expression 
30 vector pOCubi3-LP98 containing a 920 bp ubiquitin-ribosomal fusion 
gene promoter region linked to a 228 bp coding region for a 
ubiquitin polypeptide with a six bp B a mH I site at the 3 1 end ( SEQ 
ID NO. 93) that is fused at its 3' end to a gene coding for a 
lytic peptide (D5D*, SEQ ID NO. 98). The Ubi3 ubiquitin-lytic 
35 peptide nucleotide sequence corresponds to SEQ ID NO. 92. A 
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nopaline synthase polyadenylation signal is located at the 3' end 
of the lytic peptide gene. 

Figure 2 is a nap of a recombinant nucleic acid expression 
vector pUCUbi7-LP98 containing a 1220 bp polyubiquitin promoter 
5 region and 568 bp intron linked to a 228 bp coding region for a 
ubiquitin polypeptide with a six bp BaoHX site at the 3' end (SBQ 
ID NO. 96) that is fused at its 3' end to a gene coding- for a 
lytic peptide (D5D». SBQ ID NO. 98) . The Ubi7 ubiquitin-lytic 
peptide nucleotide sequence corresponds to SBQ ID NO. 95. A 
10 nopaline synthase polyadenylation signal is located at the 3' end 
of the lytic peptide gene. 

DBTA3XKD DSSCRIFTXOH 07 IBB INVENTI ON AHD 
FRKFBtRKD * MW " rt ™ffiPfrW TBXRBOF 

15 

The disclosures of prior co-pending U.S. Patent Application 
No. 08/039,620 filed June 4, 1993 in the names of Jesse M. Jaynes 
and Gordon R. Julian, U.S. Patent Application No. 08/148,889 filed 
November 8, 1993 in the name of Gordon R. Julian, U.S. Patent 

20 Application No. 08/148,491 filed November 8, 1993 in the name of 
Gordon R. Julian, U.S. Patent Application NO. 08/225,476 filed 
April 8, 1994 in the names of Jesse M. Jaynes and Gordon R. 
Julian, and U.S. Patent Application No. 08/231.730 filed April 20, 
1994 in the names of Jesse M. Jaynes and Gordon R. Julian, are all 

25 hereby incorporated herein by reference in their entirety. 

The term "amphipathic* as used herein refers to the 
distribution of hydrophobic and hydrophilic amino acid residues 
along opposing faces of an o-helix structure or other secondary 
conformation, which results in one face of the a-helix structure 

30 being predominantly hydrophobic and the other face being 

predominantly hydrophilic. The degree of amphipathy of a peptide 
can be assessed by plotting the sequential amino acid residues on 
an Edmunaon helical wheel (see also Kamtekar, S. ec ai.. Science 
262: 1680 (1993). 

35 The terms 'peptide' and "polypeptide* as used herein refer to 

& molecule composed of a chain of amino acid residues and is 

8 
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intended to be construed as inclusive of polypeptides and peptides 
per se having molecular weights of up to 10.000 daltons, as well 
as proteins having molecular weights of greater that about 10,000 
daltons, wherein the molecular weights are number average 
5 molecular weights. The term is also intended to be construed as 
inclusive of functional equivalents thereof when used in reference 
to a specific peptide coding sequence in the specification and 
claims herein. Functional equivalents of peptides and 
polypeptides include but are not limited to deletions, additions, 

10 and substitutions of amino acids in the polypeptide or peptide 
chain that do not adversely affect the overall function of the 
resulting peptide or polypeptide. 

The term 'plasmid' as used herein refers to a UNA molecule 
that is capable of autonomous replication within a host call, 

15 either extrachromosomally or as part of the host cell 

chromosome (s ) . The starting plasmids herein are conmercially 
available, are publicly available on an unrestricted basis, or can 
be constructed from such available plasmids as disclosed herein 
and/ or in accordance with published procedures. In certain 

20 instances, as will be apparent to the ordinarily skilled artisan, 
other plas m ids known in the art may be used interchangeable with 
plasmids described herein. 

The term 'ligation* as used herein refers to the process of 
forming phosphodiester bonds between two double-stranded DNA 

25 fragments. Unless otherwise specified, ligation is accomplished 
using standard procedures known to one skilled in the art. 

The term 'polymerase chain reaction, * or "PCR" as used herein 
refers to a method for amplification of a desired nucleotide 
sequence in vitro, as described in U.S. Patent No. 4,683,195, 

30 herein incorporated by reference in its entirety. 

The term 'nucleic acid' as used herein refers to 
deoxyribonucleic acid molecules (DNA) composed of a chain of 
deoxyribonucleotides and ribonucleic acid molecules (RNA) composed 
of a chain of ribonucleotides. The term 'nucleic acid' as used 

35 herein is to be construed as including functional equivalents 

thereof when used in reference to a specific nucleotide sequence 

9 
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in the specification and claims herein. Functional equivalents of 
nucleic acid molecules include synonymous coding sequences with 
one or more codon substitutions and deletions or additions that do 
not effect the overall function of the resulting nucleic acid 
5 molecule. The degeneracy of the genetic code is well known to the 
art; therefore, synonymous coding sequences with one or more codon 
substitutions can be readily determined by one of or dinar y skill 
in the art. synonymous nucleotide coding sequences vary from the 
exemplified coding sequences but encode proteins of the same amino 

10 acid sequences as those specifically provided herein or proteins 
with similar function and are therefore also regarded as 
functional equivalents thereof. 

The term "promoter' as used herein refers to an untranslated 
(i.e. one that does not result in a peptide or protein product) 

15 sequence upstream of the polypeptide coding region of a nucleotide 
sequence that controls transcription of a gene. Promoters 
typically fall into two classes, constitutive and inducible. 
Inducible promoters initiate high levels of transcription of the 
nucleic acid under their control in response to external stimuli. 

20 Constitutive promoters maintain a relatively constant level of 

transcription in a given cell. Suitable promoters for use in the 
present may include both prokaryotic and eukaryotic promoters, 
with all ubiquitin promoters being preferred, solanaceous plant 
ubiquitin promoters being highly preferred, and potato ubiquitin 

25 promoters being most preferred. Additional control sequences such 
as ribosomal binding sites and enhancers may be included as 

■ 

control sequences when necessary. 

The term "polyadenylation site* as used herein refers to a 
control sequence located on the 3 ' end of a gene construct that 
30 provides a signal for cleavage and polyadenylation of the 

transcription unit expressed from the promoter. These control 
sequences are known to one skilled in the art 

The term 'expression' as used herein refers to transcription 
and /or translation of a nucleic acid sequence coding for a 
35 protein or peptide. 
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In one embodiment, the present invention is directed to on 
isolated nucleotide sequence comprising a gene coding for a 
ubiquitin polypeptide and functional equivalents thereof, linked 
to a ubiquitin p romo ter and functional equivalents thereof. 
5 Suitable ubiquitin promoters for use in the present invention 
include, but are not limited to, ubiquitin promoters from 
solanaceous plants . Preferably, the ubiquitin p romo ter- is a 
potato plant ubiquitin promoter and most preferably it is the 
potato Ubi3 or Ubi7 promoter. In embodiments wherein the isolated 

10 nucleotide sequence codes for the potato Ubi3 promoter linked to a 
gene coding for a ubiquitin polypeptide it has a nucleotide 
sequence according to SEQ ID NO. 93. The Ubi3 promoter alone also 
has utility as constitutive p r o mo ter in eukaryotes. 

In embodiments wherein the isolated nucleotide sequence codes 

15 for the potato Ubi7 promoter linked to a gene coding for a 

ubiquitin polypeptide it has a nucleotide sequence according to 
SEQ ID NO. 96. The Ubi7 nucleotide sequence according to SEQ ZD 
NO. 96 includes an intron that is part of the ubiquitin 
transcription unit. The intron is not required for gene 

20 expression from the Ubi7 promoter, thus the Ubi7 promoter region 
without the intron can be considered as a specific functional 
equivalent of the Ubi7 promoter. The Ubi7 promoter alone, with 
or without the intron. has utility as a wound inducible promoter 
in eukaryotes. 

25 Preferably, the nucleotide sequence comprising the isolated 

ubiquitin promoter and gene coding for a ubiquitin polypeptide 
further comprises a gene coding for a lytic peptide ligated to the 
3 ' end of the gene coding for a ubiquitin polypeptide . Suitable 
genes coding for a lytic peptide have a nucleotide sequence coding 

30 for any one of the ami no acid sequences according to SBQ ID NO. 1- 
91 and 97-98. 

In one preferred embodiment, the present invention is 
directed to an isolated nucleotide sequence comprising a gene 
coding for a lytic peptide ligated to the 3 ' end of the gene 

35 coding for a ubiquitin polypeptide linked to the Obi3 ubiquitin 
promoter having a nucleotide sequence according to SEQ ID NO. 92. 

11 
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In an alternative of this embodiment, the present invention is 
directed to an isolated nucleotide sequence comprising a gene 
coding for a lytic peptide ligated to the 3* end of the gene 
coding for a ubiquitin polypeptide linked to a Ubi7 ubiquitin 
5 promoter having a nucleotide sequence according to SEQ id NO. 95. 

In another embodiment, the present invention is directed to a 
recombinant nucleic acid expression vector. The vector is 
characterized in that it comprises a nucleotide sequence wherein a 
gene coding for a ubiquitin polypeptide is linked to a ubiquitin 

10 promoter. Preferably, the present invention is directed to a 
recombinant nucleic acid expression vector characterized in that 
it further comprises a nucleotide sequence wherein a gene coding 
for a lytic peptide is ligated to the 3 1 end of the gene coding 
for a ubiquitin polypeptide linked to a ubiquitin promoter. 

15 Suitable vectors for use in this invention include any eukaryotic 
or prokaryotic expression vectors known in the art. Preferable 
vectors for use in this invention are pOC19 and pCGN1547. 

In another embodiment, the present invention is directed to a 
host cell that is transformed by a recombinant DNA expression 

20 vector comprising a gene coding for a ubiquitin polypeptide linked 
to a ubiquitin p ro moter. Suitable host cells for transformation in 
the present invention include all known bacterial host cells, with 
all strains of Escherichia coli and Agrobacterium tumefaciens 
being preferred. Preferably, the present invention is directed 

25 to a host cell the recombinant DNA expression vector further 

comprises a gene coding for a lytic peptide ligated to the 3 ' end 
of the gene coding for a ubiquitin polypeptide linked to a 
ubiquitin promoter. Suitable genes coding for a lytic peptide 
have a nucleotide sequence coding for any one of the amino acid 

30 sequences according to SEQ ID NO. 1-91 and 97-98. 

Preferably, the present invention is directed to a 
solanaceous plant host cell that is transfo r me d by a recombinant 
DNA expression vector. Most preferably the solanaceous plant cell 
is a potato plant host cell. 

35 In another embodiment, the present invention is directed to 

an isolated nucleotide sequence and functional equivalents thereof 

12 
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coding for a lytic peptide, where the nucleotide sequence has a 
sequence coding for any one of the amino acid sequences according 
to SEQ id NO. 1-91 and 97-98. 

In yet another embodiment , the present invention is directed 
5 to a purified ubiquitin polypeptide and functional equivalents 
thereof having an amino acid sequence according to SEQ ID NO. 94. 
This embodiment can further comprise a lytic peptide 
translationally fused to the carboxy terminus of a ubiquitin 
polypeptide . 

10 In another embodiment, the present invention is directed to a 

method of sub-cloning nucleotide sequences coding for lytic 
peptides and expressing such sequences in cells. The method 
comprises a first step wherein a recombinant nucleic acid 
containing a gene coding for a lytic peptide ligated to a gene 

15 coding for a ubiquitin polypeptide linked to a ubiquitin promoter 
is produced in a first host cell. Suitable first host cells 
include any known bacterial host cells. Preferably, the first 
host cell is either an Escherichia coli cell or an Agrobacceri urn 
eumexaciens cell. 

20 If the peptides are sub-cloned using such a ubiquitin- fusion 

expression vector, the following advantage results: the lytic 
peptide gene constructs have increased stability in the bacterial 
host. While not wishing to be bound by any one theory, the 
present inventors believe that the stability is due to the 

25 ubiquitin protein coding nucleic acid region fused to the 5 ' end 
of the lytic peptide nucleic acid sequence. Bacteria do not 
contain the endogenous hydrolase necessary for cleavage of the 
ubiquitin fusion protein, so the gene constructs are not toxic to 
bacteria, since active lytic peptide cannot released. Titus 

30 functional equivalents of the ubiquitin fusion polypeptide include 
any ubiquitin molecule that is capable of deceiving the host cell 
into viewing the gene construct and its products as non-toxic. 

In a variation of this embodiment, the recombinant nucleic 
acid vector is isolated from the first host cell and expressed in 

35 a second host cell. Suitable second host cells are plant and 
animal cells, preferably a solanaceous plant cell, and most 
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preferably a potato plant cell. In the second host cell the 
fusion gene is expressed at high levels and the polyprotein is 
cleaved by endogenous ubiquitin hydrolases to produce active lytic 
peptide. These transgenic hosts provide from the expression 
5 vector lytic peptides in vivo to combat bacterial infections, 
fungal infections, protozoal infections, virus infections, and 
neoplasias . In addition, expression vectors containing- ubiquitin 
promoters that are either constitutive or wound inducible are used 
to express peptides in eukaryotes. 

10 The present invention is also directed to a method of sub- 

cloning nucleotide sequences coding for lytic peptides and 
expressing such sequences in cells. The method comprises producing 
in a host cell a recombinant nucleic acid expression vector 
comprising a gene coding for a lytic peptide ligated to the 3 ' end 

15 of a gene coding for a ubiquitin promoter linked to a prokaryotic 
promoter sequence. Suitable prokaryotic promoters include those 
known to one skilled in the art to be active in prokaryotes and 
used in plasmid vectors for bacterial gene expression. 

The recombinant nucleic acid expression vector is expressed 

20 in the host cell and ubiquitin- lytic peptide fusion polypeptides 
are isolated from the host. Preferably, the host cell is either 
an Escherichia coii cell or an Agrabacterium cumefaciens cell. 
The isolated ubiquitin-lytic peptide fusion polypeptides are then 
cleaved in vitro by ubiquitin hydrolases to release the lytic 

25 peptides from the ubiquitin polypeptide (see U.S. Patent No. 

5,196,321 to Bachmair et al.). The active lytic peptides are then 
used to treab bacterial infections, fungal infections, protozoal 
infections, virus infections, and neoplasias. These isolated 
lytic peptides are in some instances glyoxylated or methylated in 

30 vitro to stabilize against proteolytic digestion in vivo. 

Ubiquitin fusion expression vectors thus have broad utility 
as cloning and expression vectors to stabilize and sub-clone lytic 
peptides nucleotide sequences, as well as a wide variety of 
protein coding nucleic acid sequences that are otherwise toxic to 

35 their hosts. The ubiquitin-lytic peptide expression vectors also 
have broad utility as an economical and efficient means to 
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synthesize lytic peptides in host cells. These lytic peptides 
have utility for combatting protozoal infections, neoplasias, 
fungal infections, viral infections, and bacterial infections in 
mammals and plants. 
5 The features and advantages of the invention are more fully 

shown by the following illustrative examples and embodiments, 
which are not to be limitingly construed as regard the broad 
scope, utility, and applicability of the invention. 

10 Pimmnla 1 

Rmrt»Ht»nr»rivP T.vfic Peptides and TThimiifin nnlvnimUrt* 

Set out in Table 1 below as illustrative examples of lytic 
peptides are the amino acid sequences of families of related lytic 

15 peptides. These lytic peptides are designated for ease of 

reference as SEQ ID NO. 1-91 and 97-98. Nucleic acid sequences 
coding for these lytic peptides and functional equivalents thereof 
represent examples of lytic peptide nucleic acid sequences that 
are sub-cloned to make ubiquitin-lytic peptide fusion gene 

20 constructs and polypeptides. The ubiquitin polypeptide, 

designated for ease of reference as SEQ ID NO. 94, and functional 
equivalents thereof, represents an example of the 5' fusion 
ubiquitin polypeptide. 

25 Mill 1: LYTIC PSPTZd SBQOSHCM 

SEP ID MO. 1 , 

Phe Ala Val Ala Val Lys Ala Val Lys Lys Ala Val Lys Lys Val Lys 
15 10 15 

30 Lys Ala Val Lys Lys Ala Val Lys Lys Lys Lys 
20 25 

sro td wn. 2 

Phe Ala Val Ala Val Lys Ala Val Ala Val Lys Ala Val Lys Lys Ala 
35 1 5 ' 10 15 
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Val Lys Lys Val Lys Lys Ala Val Lys Lys Ala Val Lys Lys Lys Lys 
20 25 30 

5 Phe Ala Val Ala Val Lys Ala Val Ala Val Lys Ala val Ala Val Lys 
1 5 10 15 

Ala Val Lys Lys Ala Val Lys Lys Val Lys Lys Ala Val Lys- Lys Ala 

20 25 30 

Val Lys Lys Lys Lys 
10 35 

sro tp no. a 

Phe Ala Val Ala Val Lys Ala Val Lys Lys Ala Val Lys Lys Val Lys 
1 5 10 15 

15 Lys Ala Val Lys Lys Ala Val 
20 

grin th mo 5 

Phe Ala Val Ala Val Lys Ala Val Ala Val Lys Ala Val Lys Lys Ala 
20 1 5 10 15 

Val Lys Lys Val Lys Lys Ala Val Lys Lys Ala Val 
20 25 

<ren Tti an. 6 

25 Phe Ala Val Ala Val Lys Ala Val Ala val Lys Ala Val Ala Val Lys 
15 10 15 

Ala Val Lys Lys Ala Val Lys Lys Val Lys Lys Ala Val Lys Lys Ala 
20 25 30 

Val 

30 

sen th wo. 7 

Phe Ala Val Gly Leu Arg Ala He Lys Arg Ala Leu Lys Lys Leu Arg 

1 5 10 15 

Arg Gly Val Arg Lys Val Ala Lys Arg Lys Arg 
35 20 25 
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SEQ Tn fl 

Phe Ala Val Gly Leu Arg Ala lie Lys Arg Ala Leu Lys Lys Leu Arg 

15 10 15 

Arg Gly Val Arg Lys Val Ala 
5 20 

SEQ TP NQ. 9 

Lys Arg Lys Arg Ala Val Lys Arg Val Gly Arg Arg Leu Lys Lys Leu 
15 10 15 

10 Ala Arg Lys lie Ala Arg Leu Gly Val Ala Phe 

20 "25 



SEQ TT? NO ■ 10 

Ala Val Lys Arg Val Gly Arg Arg 
15 1 5 

Ala Arg Leu Gly Val Ala Phe 
20 



Leu Lys Lys Leu Ala Arg Lys lie 
10 15 



20 Phe Ala Val Gly Leu Arg Ala lie Lys Arg Ala Leu Lys Lys Leu Arg 
15 10 15 

Arg Gly Val Arg Lys Val Ala Lys Arg Lys Arg Lys Lys Asp Leu 
20 25 30 



25 SEQ Tn HQ " 

Phe Ala Val Gly Leu Arg Ala lie Lys Arg Ala Leu Lys Lys Leu Arg 

1.5 10 15 

Arg Gly Val Arg Lys Val Ala Lys Asp Leu 
20 25 

30 

Lys Arg Lys Arg Ala Val Lys Arg Val Gly Arg Arg Leu Lys Lys Leu 

1*5 10 15 

Ala Arg Lys He Ala Arg Leu Gly Val Ala Phe Lys Asp Leu 
35 20 25 30 
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SFO TP NO. 14 

Ala Val Lys Arg val Gly Arg Arg Leu Lys Lys Leu Ala Arg Lys lie 

1 5 10 15 

Ala Arg Leu Gly Val Ala Phe Lys Asp Leu 
5 20 25 

Lys Lys Lys Lys Phe Val Lys Lys Val Ala Lys Lys Val Lys Lys Val 
1 5 10 15 

10 Ala Lys Lys Val Ala Lys Val Ala Val Ala Val 
20 - 25 

Sffn ip no. 16 

Lys Lys Lys Lys Phe Val Lys Lys Val Ala Lys Lys Val Lys Lys Val 
15 1 5 10 15 

Ala Lys Lys Val Ala Lys Val Ala Val Ala Lys val Ala Val Ala Val 
20 25 30 

SEP Tn 17 

20 Lys Lys Lys Lys Phe Val Lys Lys Val Ala Lys Lys Val Lys Lys Val 
15 10 15 

Ala Lys Lys Val Ala Lys Val Ala Val Ala Lys Val Ala Val Ala Lys 

20 25 30 

Val Ala Val Ala Val 
25 35 

SEQ ™ 18 

Phe Val Lys Lys Val Ala Lys Lys Val Lys Lys Val Ala Lys Lys Val 
1 5 10 15 

30 Ala Lys Val Ala Val Ala Val 
20 

Spi Tn KO. 19 

Phe val Lys Lys Val Ala Lys Lys val' Lys Lys Val Ala Lys Lys Val 
35 1 " " 5 10 15 

Ala Lys Val Ala Val Ala Lys Val Ala Val Ala Val 
20 25 
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swn Tn wr> sn 

Phe Val Lys Lya Val Ala Lys Lys Val Lya Lys Val Ala Lys Lys Val 

15 10 15 

Ala Lys Val Ala Val Ala Lys Val Ala Val Ala Lys Val Ala Val Ala 
20 25 30 

Val 



10 Lys Lys Lys Lys Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val 
1 5*10 15 

Ala Lys Val Ala Lys Lys Val Ala Lys Lys val 
20 25 



15 s*" Tn MTV 22 

Lys Lys Lys Lys Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val 

15 10 15 

Ala Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala 
20 25 30 



20 



25 



awn Tn ho, 21 

Lys Lys Lys Lys Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val 

1 5 10 15 

Ala Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala 

20 25 30 

Lys Val Ala Lys Lys 
35 . 



SEP IP NO. 24 

30 Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val Ala Lys Val Ala 
15 10 15 

Lys Lys Val Ala Lys Lys Val 
20 
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SEP TP MO, 25 

Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys val Ala Lys val Ala 

1 * 5 10 15 

Lys Lya Val Ala Lys Lys Val Ala Lys Lys Val Ala 
5 20 25 

SEP. TP TO, 26 

Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val Ala Lys Val Ala 
15 10 15 

10 Lys Lys val Ala Lys Lys Val Ala Lys Lys Val Ala Lys Val Ala Lys 
20 25 30 

Lys 

SEP TP NO. 27 

15 Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val Ala Lys Val Ala 
15 10 15 

Lys Lys Val Ala Lys Lys Val Lys Lys Lys Lys 
20 25 



20 TT3 NO. 28 

Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val Ala Lys Val Ala 

1 5 10 15 

Lys Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala Lys Lys Lys Lys 
20 25 30 



25 



30 



ggo Tn no. 29 

Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val Ala Lys Val Ala 

1*5 10 15 

Lys Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala Lys Val Ala Lys 

20 25 30 

Lys Lys Lys Lys LyB 
35 



tt? mp. 3P 

Val Lys Ala Lys Val Lys Ala Lys Val Lys Lys Lys Lys Lys 
5 10 15 



35 Phe Lys 
1 
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ran -m wo- 31 

Phe Lys val Lys Ala Lys Val Lys Ala Lys Val Lya Ala Lys Val Lya 
15 10 15 

5 Ala Lys Lya Lya Lys 
20 

rko m MTV 32 

Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 
10 1 5 10 15 

Ala Lys Val Lys Ala Lys Val Lys Lys Lys Lys 
20 25 

ran TP no. 33 
15 Phe Lys Val Lys Ala Lys val Lys Ala Lys Val Lys 
15 10 

SEP. TD NO. 3A 

Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 
20 1 5 10 15 

Ala 

ran td im. 35 

Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 
25 1 5 10 15 

Ala Lys Val Lys Ala Lys Val 
20 

ran TD NO. 36 

30 Lys Lys Lys Lys Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 
i5 10 15 



35 



RFO TT) NO. 37 

Lys Lys Lys Lys Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 
15. 10 15 
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Ala Lys Val Lys Ala 
20 



Lys Lys Lys Lys Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 

1 5 10 15 

Ala Lys Val Lys Ala Lys Val Lys Ala Lys Val 



20 



25 



10 TH Mn ^ 

Phe Lys Lys Val Lys Lys Val Ala Lys Lys Val Cys Lys Cys Val Lys 
1 "5 10 1 5 

Lys Ala Val Lys Lys Val Lys Lys Phe 
20 25 



15 



20 



Phe Ala val Ala Val Lys Ala Val Lys Lys Ala Val Lys Lys Val Lys 

1 5 10 « 

Lys Ala Val Lys Lys Ala Val Cys Cys Cys Cys 
20 25 



SUP TO NO. 41 

Cvs Cys Cys Cys Phe Val Lys Lys Val Ala Lys Lys Val Lys Lys Val 
! 5 10 » 

25 Ala Lys Lys Val Ala Lys Val Ala Val Ala Val 
20 25 



Phe Ala val Ala Val Lys Ala Val Lys Lys Ala Val Lys Lys Val Lys 
30 1 5 10 « 

Lys Ala Val Lys Lys Ala Val Ser Ser Ser Ser 
20 25 



cpo TP NO. 43 

35 Ser Ser Ser Ser Phe Val Lys Lys Val Ala Lys Lys Val Lys Lys Val 
1 5 10 15 
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Ala Lys Lys Val Ala Lys Val Ala Val Ala Val 
20 25 

BEO TT1 HtV AA 

Phe Ala Leu Ala Leu Lys Ala Leu Lys Lys Ala Leu Lys Lys Leu Lys 

15 10 15 

Lys Ala Leu Lys Lys Ala Leu 
20 



10 SEP IP m. 41 

Leu Ala Lys Lys Leu Ala Lys Lys Leu Lys Lys Leu Ala Lys Lys Leu 

1 5 10 15 

Ala Lys Leu Ala Leu Ala Phe 
20 



15 



20 



seo r-n nn ar 

Phe Ala Phe Ala Phe Lys Ala Phe Lys Lys Ala Phe Lys Lys Phe Lys 

1 5 10 15 

Lys Ala Phe Lys Lys Ala Phe 
20 



SEO TD NT). 47 

Phe Ala He Ala lie Lys Ala lie Lys Lys Ala He Lys Lys He Lys 
15 10 15 

25 Lys Ala lie Lys Lys Ala lie 
20 

SEQ TD MT> AH 

Phe Ala Lys Lys Phe Ala Lys Lys Phe Lys Lys Phe Ala Lys Lys Phe 
30 1 5 10 15 

Ala Lys Phe Ala Phe Ala Phe 
20 
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gwn tt> wr>. 49 

Phe Lys Arg Leu Ala Lys lie Lya val Leu Arg Leu Ala Lys lie Lys 

1 ' 5 10 15 

Arg 



10 



spn m no 50 

Lys Leu Lys Leu Ala Val Lys Leu Val Gly Leu Leu Arg Lys- Lys Arg 

15 10 15 

Ala Leu Lys lie Ala Leu Arg Gly Val Ala Lys Arg Ala Gly Arg Leu 

20 25 30 

Ala Val Arg Lys Phe 
35 



cpn xn wo qi 

15 Phe Ala Arg Ala Arg Lys Ala Arg Lys Lys Ala Arg Lys Lys Arg Lys 
15 10 15 

Lys Ala Arg Lys Lys Ala Arg Lys Asp Arg 
20 25 



20 ssq Tr > aa 

Phe Ala Val Ala Val Cys Ala Val Cys Cys Ala Val Cys Cys Val Cys 

1,5. 10 15 

Cys Ala Val Cys Cys Ala Val 
20 



25 



30 



SSQ Tn HP- « 

Phe Ala Val Ala Val Ser Ala Val Ser Ser Ala Val Ser Ser Val Ser 

15 10 15 

Ser Ala Val Ser Ser Ala Val 
20 



Phe Ala Val Ala Val Ser Ala Val Ser Ser Ala Val Ser Ser Val Ser 
15 10 15 

35 Ser Ala Val Ser Ser Ala Val Ser Ser Ser Ser 
20 25 
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am tp wo. 55 

Phe Ala Lys Lys Phe Ala Lys Lys Phe Lys Lys Phe Ala Lys Lys Phe 
15 10 15 

5 Ala Lys Phe Ala Phe Ala Phe Lys Lys Lys Lys 
20 25 

gwn tt> Km S6 

Lys Lys Lys Lys Phe Ala Lys Lys Phe Ala Lys Lys Phe Lys Lys Phe 
10 1 5 . 10 15 

Ala Lys Lys Phe Ala Lys Phe Ala Phe Ala Phe 
20 25 



SFO TP WO. 57 

15 Phe Ala Arg Lys Phe Leu Lys Arg Phe Lys Lys Phe Val Arg Lys Phe 
15 10 15 

lie Arg Phe Ala Phe Leu Phe 
20 



20 cpti Tn >3Q SB 

Phe Ala Arg Lys Phe Leu Lys Arg Phe Lys Lys Phe Val Arg Lys Phe 

1 5 10 15 

lie Arg Phe Ala Phe Leu Phe Lys Arg Lys Arg 
20 25 



25 



30 



svr> rr> wn. so 

Lys Arg Lys Arg Phe Ala Arg Lys Phe Leu Lys Arg Phe Lys Lys Phe 

1 5 10 15 

Val Arg Lys Phe He Arg Phe Ala Phe Leu Phe 
20 25 



giro T n wn fiO 

lie Ala Lys Lys He Ala Lys Lys He Lys Lys He Ala Lys Lys He 
1 5 10 15 

35 Ala Lys He Ala He Ala He 
20 
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ggn rr> wn. 61 

He Ala Lys Lys lie Ala Lys Lys 
1 5 
5 Ala Lys He Ala lie Ala He Lys 
20 

Lys Lys Lys Lys He Ala Lys Lys 
10 1 5 

Ala Lys Lys He Ala Lys He Ala 
20 



He Lys Lys He Ala Lys Lys He 

10 15 
Lys Lys Lys 
25 

He Ala Lys Lys He Lys Lys He 

10 15 
He Ala He 
25 



15 He Ala Arg Lys He Leu Lys Arg He Lys Lys He Val Arg Lys Phe 
15 io 15 

He Arg He Ala He Leu He 
20 



20 TH NO. 64 

He Ala Arg Lys He Leu Lys Arg He Lys Lys He Val Arg Lys Phe 

15 10 15 

He Arg He Ala He Leu lie Lys Arg Lys Arg 
20 25 



25 



30 



fWl IB NO. 65 

Lys Arg Lys Arg He Ala Arg Lys He Leu Lys Arg He Lys Lys He 

1 ' 5 10 15 

Val Arg Lys Phe He Arg He Ala He Leu He 
20 25 



35 



TPO TP NO 66 

Phe Lys Leu 

1 
Leu 



Arg Ala Lys He Lys Val Arg Leu Arg Ala Lys He Lys 
5 10 15 
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«ren Tn no. 67 

Lys Arg Lys Arg Phe Lys Leu Arg Ala Lys He Lys Val Arg Leu Arg 

15 " 15 

Ala Lys lie Lys Leu 
5 20 

cm m wry 68 

Phe Lys Leu Arg Ala Lys He Lys Val Arg Leu Arg Ala Lys He Lys 
X * 5 10 15 

10 Leu Lys Arg Lys Arg_ 
20 



15 



TP NT). 69 

Phe Lys Leu Arg Ala Lys He Lys Val Arg Leu Arg Ala Lys He Lys 

1 5 10 15 

Leu Arg Val Lys Leu Lys He 
20 



<reo TD no. 70 

20 Phe Lys Leu Arg Ala Lys He Lys Val Arg Leu Arg Ala Lys He Lys 

Leu Arg Val Lys Leu Lys He Lys Arg Lys Arg 
20 25 



25 SEP. Tn HO - 71 

Lys Arg Lys Arg Phe Lys Leu Arg Ala Lys He Lys Val Arg Leu Arg 

1.5 10 I 5 

Ala Lys He Lys Leu Arg Val Lys Leu Lys He 

20 25 



30 



35 



ere Tn no 72 

Phe Lys Leu Arg Ala Lys He Lys Val Arg Leu Arg Ala Lys He Lys 

IS io 15 

Leu Arg Val Lys Leu Lys He Arg Ala Arg He Lys Leu 
20 25 
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ggn -m nn. 73 

Phe Lys Leu Arg Ala Lys lie Lys Val Arg Leu Arg Ala Lys He Lys 

15 10 15 

Leu Arg Val Lya Leu Lys He Arg Ala Arg He Lys Leu Lys Arg Lys 
20 25 30 

Arg 



10 



CTT> TD MO. 7d 

Lys Arg Lys Arg Phe Lys Leu Arg Ala Lys He Lys Val Arg Leu Arg 

15 10 15 

Ala Lys He Lys Leu Arg Val Lys Leu Lys He Arg Ala Arg He Lys 
20 25 30 

Leu 



15 CTD TT> WD. 75 

Phe Lys Leu Arg Ala Lys life Lys Val Arg Leu Arg Ala Lys He Lys 

15 10 15 

Leu Val Phe Ala He Leu Leu 
20 



20 



25 



ggn TT. NO 7fi 

Phe Lys Leu Arg Ala Lys He Lys Val Arg Leu Arg Ala Lys He Lys 

15 10 " 15 

Leu Val Phe Ala He Leu Leu Lys Arg Lys Arg 
20 25 



SEP TT) NO. 77 

Lys Arg Lys Arg Phe Lys Leu Arg Ala Lys He Lys Val Arg Leu Arg 
15 10 15 

30 Ala Lys He Lys Leu Val Phe Ala He Leu Leu 
20 25 



35 



flfin TP M. 78 

Val Phe Ala He Leu Leu Phe Lys Leu Arg Ala Lys He Lys Val Arg 
15 10 15 



28 



SUBSTITUTE SHEET (RULE 26) 



WO 96/03519 



PCT/US95/093J9 



Leu Arg Ala Lys He Lys Leu 
20 

STO IP MP. 79 

5 Val Phe Ala He Leu Leu Phe Lya Leu Arg Ala Lys He Lys Val Arg 
15 10 15 

Leu Arg Ala Lys He Lys Leu Lys Arg Lys Arg 
20 25 



10 SFO TP NO- BO 

Lys Arg Lys Arg Val Phe Ala He Leu Leu Phe Lys Leu Arg Ala Lys 

15 10 15 

He Lys Val Arg Leu Arg Ala Lys He Lys Leu 
20 25 



15 



20 



sbt. m nr> m 

Val Gly Glu Cys Val Arg Gly Arg Cys Pro Ser Gly Met Cys Cys Ser 

15 10 15 

Gin Phe Gly Tyr Cys Gly Lys Gly Pro Lys Tyr Cys Gly 
20 25 



Val Gly Glu Cys Val *xg Gly Arg Cys Pro Ser Gly Met Cys Cys Ser 
15 10 15 

25 Gin Phe Gly Tyr Cys Gly Lys Gly Pro Lys "lyr Cys Gly Arg 
20 25 30 

sro -m wn. 

Leu Gly Asp Cys Leu Lys Gly Lys Cys Pro Ser Gly Met Cys Cys Ser 
30 1 5 10 15 

Asn Tyr Gly Phe Cys Gly Arg Gly Pro Arg Phe Cys Gly Lys 
20 25 30 



SPO TH NO. 8* 

35 Gin Cys He Gly Asn Gly Gly Arg Cys Asn Glu Asn Val Gly Pro Pro 
15 10 15 
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Tyr Cys Cys Ser Gly Phe Cys Leu Arg Gin Pro Gly Gin Gly Tyr Gly 

20 25 30 

Tyr Cys Lys Asn Arg 
35 

5 

cto T n >«-> 85 

Cys lie Gly Asn Gly Gly Arg Cys Asn Glu Asn Val Gly Pro- Pro Tyr 

15 10 1* 

Cys Cys Ser Gly Phe Cys Leu Arg Gin Pro Asn Gin Gly Tyr Gly Val 
10 20 25 30 

Cys Arg Asn Arg 
35 

crep m KD 86 

15 Cys lie Gly Gin Gly Gly Lys Cys Gin Asp Gin Leu Gly Pro Pro Phe 
15 10 15 

Cys Cys Ser Gly Tyr Cys Val Lys Asn Pro Gin Asn Gly Phe Gly Leu 

20 25 30 

Cys Lys Gin Lys 
20 35 

■inn tp no. S7 

Gin Lys Leu cys Glu Arg Pro Ser Gly »r Trp Ser Gly Val Cys Gly 
15 10 15 

25 Asn Asn Asn Ala Cys Lys Asn Gin Cys lie Asn Leu Glu Lys Ala Arg 
20 25 30 

His Gly Ser Cys Asn Tyr Val Phe Pro Ala His Lys 
35 40 



30 <rer> to mo. 88 

Gin Arg Val Cys Asp Lys Pro Ser Gly Thr Trp Ser Gly Leu Cys Gly 

1 5 10 15 

Asn Asn Asn Ala Cys Arg Gin Asn Cys He Gin Val Asp Arg Ala Lys 
20 25 30 

35 Lys Gly Ser Cys Gin Phe Leu Tyr Pro Ala Lys Lys 
35 40 
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SEQ ID NO. aa 

Gin Lys Leu Cys Gin Arg Pro Ser Gly Thr Trp Ser Gly Val Cya Gly 
15 10 15 

5 Asn Asn Asn Ala Cys Lys Asn Gin Cys He Arg Leu Glu Lys Ala Arg 
20 25 30 

His Gly Ser Cys 
35 

10 sry*) td no on 

Gin Arg Val Cys Asn Lys Pro Ser Gly Thr Trp Ser Gly Leu Cys Gly 

15 10 15 

Asn Asn Asn Ala Cys Arg Gin Asn Cys He Lys Val Asp Arg Ala Lys 
20 25 30 

15 Lys Gly Ser Cys 
35 

SEQ IP MP. 91 

Met Leu Glu Glu Leu Phe Glu Glu Met Thr Glu Phe He Glu Glu Val 
20 1 5 10 15 

He Glu Thr Met 
20 

SEQ to wn <M 
25 Met Gin He Phe Val Lys Thr Leu 
1 5 
Thr Gly Lys Thr He Thr Leu Glu Val Glu Ser Ser Asp Thr 

10 15 20 

He Asp Asn Val Lys Ala Lys He Gin Asp Lys Glu Gly He 
30 25 30 35 

Pro Pro Asp Gin Gin Arg Leu He Phe Ala Gly Lys Gin Leu 

40 45 50 

Glu Asp Gly Arg Thr Leu Ala Asp Tyr Asn He Gin Lys Glu 
55 60 
35 Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly Gly Ser 
65 70 75 
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BFT> TD MTV q? 

Lys Arg Lys Arg Ala Val Lys Arg Val Gly Arg Arg Leu Lys Lya Leu 
1 5 10 15 

5 Ala Arg Lys lie Ala Arg Leu Gly Val Ala Lys Leu Ala Gly Leu Arg 
20 25 30 

Ala Val Leu Lys Phe 
35 

10 sim th wr> QR 

Ala Val Lys Arg Val Gly Arg Arg Leu Lys Lys Leu Asp Arg Lys lie 

15 10 15 

Asp Arg Leu Gly Val Asp Phe 
20 

15 

rrm^r^^rm »f nvvi m,i t i n-ivr i r. Penrifle Funinn Pi nmni tin W i th 

nh^miifin-rihoanml Fusion Gene Pmnnr.Br (Pbi3 1 

20 Exemplary and preferred pUC19 and pCGN1547 plasmid vectors 

containing a potato (5olanum tuberosum) ubiquitin-ribosomal fusion 
promoter (Obi3) . a region coding for a ubiquitin polypeptide, and 
a gene coding for a lytic peptide are constructed. 

To obtain the genomic clone containing a ubiquitin-riboaomal 
25 fusion promoter and ubiquitin polypeptide coding region, a Xt 1X11 
potato genomic library is first prescreened using PCR. The PCR 
primers are homologous to regions of the ubiquitin-ribosomal 
fusion cDNA (see Garbarino J., et al. , Plant Molecular Biology 20: 
235(1992); Garbarino J. and Belknap W. , Plant Molecular Biology 
30 24: 119 (1994) ; both of which are hereby incorporated by reference 
herein in their entirety) . A primer 5' to the beginning ATG of 
ubiquitin and a primer complementary to a sequence near the 5' end 
of the ribosomal protein are used. 

' The library is plated in 22 aliquots containing approximately 
35 0.5x10 s pfu (plaque forming units) each on an E. coli lawn. A plug 
is taken from each of the 22 resulting plaques and the eluant from 
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each is subjected to PCR under standard conditions. The PCR 
products are run on agarose gels. The gels are then blotted and 
probed with the ubiquitin coding region of the ubiquitin-ribosomaJ 
fusion cDHA according to standard conditions. Two of the plugs 
5 produce PCR products that hybridize to the cDNA probe. Both of 
these are the correct size for the predicted ubiquitin-ribosomal 
fusion genomic fragment. 

The eluants from these two plugs are then plated and screened 
with the ubiquitin coding region of the ubiquitin-ribosomal fusion 

10 cOTA according to standard conditions. For verification, the 

positive plaques from the initial screen are replated and screened 
with a probe containing both the ribosomal protein-coding region 
and the 3' end of the potato ubiquitin-ribosomal fusion cDNA. 

The genomic clones are sequenced using Sequenase version 2.0 

15 (United States Biochemical Corporation) or Pr omega fuel ENA 
Sequencing System using standard conditions. A genomic clone 
containing both the ubiquitin-riboscBial fusion p rom oter region and 
the ubiquitin-ribosomal fusion coding region is identified. 

A c h i mer ic gene is then constructed with a portion of the 

20 potato ubiquitin-ribosomal fusion genomic clone ligated to a lytic 
peptide gene. PCR is used to generate the Dbi3 promoter and 
ubiquitin portion of the chimeric gene. The 0bi3 promoter region 
includes the 920 bp promoter region upstream of the- ubiquitin ATG, 
and the ubiquitin polypeptide coding region is 228 bp plus 6 bp of 

25 a BamHI restriction site at the 3' end (SEQ ID no. 93) . The 

primers contain BamHI restriction sites and are homologous to the 
5' end of the* Dbi3 promoter and to the 3' end of the ubiquitin 
polypeptide coding region. The ubiquitin-ribosomal fusion genomic 
clone is used as the amplification template. This insert is first 

30 sub-cloned into the plasmid pCOJ1547, as described in Garbarino ec 
ai. , Plant Molecular Biology 24: 119 (1994). The 0bi3 insert is 
then isolated from pCGN1547 using the BamHI sites and ligated into 
pUC19 under standard conditions. Transformation of E. coli is 
done according to standard conditions and correct sub-clones are 

35 confirmed by mini -prep or PCR UNA analysis. This plasmid is 
designated pOCUbi3. 
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A nucleotide fragment coding for the lytic peptide 
< corresponding to the amino acid sequence SEQ TD NO. 98) is 
synthesized using a nucleic acid synthesizer, adding a stop cod on 
to the 3' end, and used as a PCR template. The 5' PCR primer 
5 homologous to the lytic peptide nucleotide sequence contains a 

BamHI site, and the 3' primer contains an Xbal site. These sites . 
are used to sub-clone the PCR generated insert into pOC19 . A 
nopaline synthase polyadenylatian signal (NOS3 ' ) is then cloned 3 ' 
to the lytic peptide sequence. Following sequence analysis, the 
10 BamHI insert containing the Ubi3 promoter and ubiquitin coding 
region (SEQ ID NO. 93) is cloned 5' to the lytic peptide. 

After transforming E. coli under standard conditions, pOC19 
sub-clones are selected for mini -prep or PCR SNA analysis 
according to standard conditions. The direction of the p ro m o ter 
15 is confirmed and the junction sequences are verified by sequencing 
according co standard conditions. The resulting 0bi3 ubiquicin- 
lytic peptide fusion gene construct corresponds to SEQ ID NO. 92. 
Unlike previous cloning attempts using the CaMV3SS promoter, as 
described in the Background section, the sequence does not reveal 
20 any point mutations in the lytic peptide sub-clones. The plasmid 
is stable in the E. coli host and did not inhibit its growth. 

The resulting pUC19 recombinant plasmid is shown in the 
plasmid map in Figure 1. The sequence for the Ubi3 -ubiquitin 
insert containing the ubiquitin-ribosomal fusion gene p romo ter and 
25 the ubiquitin coding region corresponds to SEQ ID NO. 93 in Table 
2 below. The sequence for the chimeric Ubi3 ubiquitin- lytic 
peptide fusion gene construct corresponds to SEQ ID NO. 92 in 
Table 2 below. This plasmid is designated as pDCDbi3-LP98. 
The entire Dbi3 ubiquitin-lytic peptide fusion gene 
30 construct, including the polyadenylation site, was isolated from 
pDC19 as an Asp718/HindIIZ restriction fragment and sub-cloned 
into the pCGOT.547 Agrobacterium vector for use in plant 
transformation (see McBride, et ai.. Plant Molecular Biology 14: 
269 (1990). This plasmid is designated as pCGN0bi3-LP98. 
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TABLK 2: HUCLBOTXDB SBQDBHCB OF POTATO UBIQOXTXN-RIBOSCMAL FDSZCH 
VBCMORR (TJBI3) AHD UBXQDTTXH COOXBS RXOIOH XHSXRT, AKD UBXGOXVZH- 
LTTIC PgP T IOH FUSION OSKB C0SSTKOCT 



5 snr> m wr> <^ 





CCAAAGCACA 


TACTTATCGA 


TTTAAATTIC 


ATCGAAGAGA 


TTAATATCGA 


50 




ATAATCATAT 


ACATACTTTA 


AATACATAAC 


AAATTTTAAA 


TACATATATC 


100 


10 


TGGTATATAA 


TTAATTTTTT 


AAAGTCATGA 


AGTATGTATC 


AAATACACAT 


150 




ATQGAAAAAA 


TTAACTATTC 


ATAATTTAAA 


AAATAGAAAA 


GATACATCTA 


200 


15 


GTGAAATTAG 


GTGCATGTAT 


CAAATACATT 


AGGAAAAGGG 


CATATATCTT 


250 


GATCTAGATA 


ATTAACGATT 


TTGATTTATG 


TATAATTTCC 


AAATGAAGGT 


300 




TTATATCTAC 


TTCAGAAATA 


ACAATATACT 


TTTATCAGAA 


CATTCAACAA 


350 


20 


AGCAACAACC 


AACTAGAGT3 


AAAAATACAC 


ATTGTTCTCT 


AGACATACAA 


400 




AATTGAQAAA 


AGAATCTCAA 


AATTTAGAGA 


AACAAATCTG 


AATTTCTAGA 


450 


25 


AGAAAAAAAT 


AATTATGCAC 


TTTGCTATTG 


CTCGAAAAAT 


AAATGAAAGA 


500 


AATTAGACTT 


TTTTAAAAGA 


TGTTAGACTA 


GATATACTCA 


AAAGCTATTA 


550 




AAGGAGTAAT 


ATTCTTCTTA 


CATTAAGTAT 


TTTAGTTACA 


GTCCTGTAAT 


600 


30 


TAAAGACACA 


TTTTAGATTG 


TATCTAAACT 


TAAATGTATC 


TAGAATACAT 


650 




ATATTTSAAT 


GCATCATATA 


CATGTATCCG 


ACACACCAAT 


TCTCATAAAA 


700 


35 


AACGTAAXAT 


CCTAAACTAA 


TTTATCCTTC 


AAGTCAACTT 


AAGCCCAATA 


750 


TftCATTTTCA 


TCTCTAAAGG 


CCCAAGTGGC 


ACAAAATGTC 


AGGCCCAATT 


800 




ACGAAGAAAA 


GGGCTTGTAA 


AACCCIAATA 


AAGTGGCACT 


GGCAGAGCTT 


850 


40 


ACACTCTCAT 


TCCATCAACA 


AAGAAACCCT 


AAAAGCCGCA 


GCGCCACTGA 


900 




TTiU'lCrCCT 


CCAGGCGAAG 


ATG CAG ATC TTC GTG AAG ACC TTA 


944 



Met Gin lie Phe Val Lys Thr Leu 



ACG GGG AAG ACG ATC ACC CTA GAG GTT GAG TCT TCC GAC ACC 986 
Thr Gly Lys Thr lie Thr Leu Glu Val Glu Ser Ser Asp Thr 
10 15 20 

50 ATC GAC AAT GTC AAA GCC AAG ATC CAG GAC AAG GAA GGG ATT 1028 
lie Asp Asn Val Lys Ala Lys He Gin Asp Lys Glu Gly He 
25 30 35 
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CCC CCA GAC CAS CAS CGT TPS ATT TIC GCC GGA AAG CAG CTT 1070 
Pro Pro Asp Gin Gin Arg Leu He Phe Ala Gly Lys Gin Leu 
g 40 45 50 

GAG GAT GGT CGT ACT CTT GCC GAC TAC AAC ATC CAG AAG GAG 1112 
Glu Asp Gly Arg Thr Leu Ala Asp Tyr Asn He Gin Lys Glu 
55 60 

10 TCA ACT CTC CAT CTC GTG CTC CGT CTC CGT GGT GGT • 1148 

Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly 
65 70 75 

GGA TCC GCT GTT AAA AGA GTG GGT CGT AGG TTG AAA AAG TTG 1190 
15 Gly Ser Ala val Lys Arg Val Gly Arg Arg Leu Lys Lys Leu 
80 85 90 

GAC CGT AAG ATT GAT AGG TTA GGA GTT GAT TTT TGATC 1228 
Asp Arg Lys lie Asp Arg Leu Gly val Asp Phe 
20 95 100 

ran rn wn o-> 





CCAAAGCACA 


TACTTATCGA 


TTTAAATTTC 


ATCGAAGAGA 


TTAATATCGA 


50 


25 


ATAATCATAT 


ACATACTTTA 


AATACATAAC 


AAATTTTAAA 


TACATATATC 


100 




TGGTATATAA 


TTAATTTTTT 


AAAGTCATGA 


AGTATGTATC 


AAATACACAT 


150 


30 


ATGGAAAAAA 


TEAACTATTC 


ATAATTTAAA 


AAATAGAAAA 


GATACATCTA 


200 


GTGAAATTAG 


GTGCATGTAT 


CAAATACATT 


AGGAAAAGGG 


CATATATCTT 


250 




GATCTAGATA 


ATTAACGATT 


TTGATTTATG 


TATAATTTCC 


AAATGAAGGT 


300 


35 


TTATATCTAC 


TTCAGAAATA 


ACAATATACT 


TTTATCAGAA 


CATTCAACAA 


350 




AGCAACAACC 


AACTAGAGTG 


AAAAATACAC 


ATTGTTCTCT 


AGACATACAA 


400 


40 


AATTGAGAAA 


AGAATCTCAA 

• 


AATTTAGAGA 


AACAAATCTG 


AATTTCTASA 


450 


AGAAAAAAAT 


AATTATGCAC 


TTTGCTATTG 


CTCGAAAAAT 


AAATGAAAGA 


500 




AATTAGACTT 


TTTTAAAAGA 


TGTTAGACTA 


GATATACTCA 


AAAGCTATTA 


550 


45 


AAGGAGTAAT 


ATTCTTCTTA 


CATTAAGTAT 


TTTAGTTACA 


GTCCTGTAAT 


600 




TAAAGACACA 


TTTTAGAT7G 


TATCTAAACT 


TAAATGTATC 


TAGAATACAT 


650 


50 


ATATTTGAAT 


GCATCATATA 


CATSTATCCG 


ACACACCAAT 


TCTCATAAAA 


700 


AACGTAATAT 


CCTAAACTAA 


TTTATCCTIC 


AAGTCAACTT 


AAGCCCAATA 


750 




TACATTTTCA 


TCTCTAAAGG 


CCCAAGTGGC 


ACAAAATGTC 


AGGCCCAATT 


800 
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ACGAAGAAAA GGGCTTGTAA AACCCTAATA AAGTGGCACT GGCAGAGCTT 850 
ACACTCTCAT TCCATCAACA AAGAAACCCT AAAAGCCGCA GCGCCACTGA 900 

5 

Tl ' lC T C TCC T CCAGGCGAAG ATS CAG ATC TIC GTG AAG ACC TTA 944 

Met Gin He Phe Val Lys Thr Leu 
1 5 

10 ACG GGG AAG ACG ATC ACC CTA GAG GTT GAG TCT TCC GAC ACC 986 
Thr Gly Lys Thr He Thr Leu Glu Val Glu Ser Ser Asp Thr 
10 15 20 

ATC GAC AAT GTC AAA GCC AAG ATC CAG GAC AAG GAA GGG ATT 1028 
15 He Asp Asn Val Lys -Ala Lys He Gin Asp Lys Glu Gly He 
25 30 35 

CCC CCA GAC CAG CAG CGT TTG ATT TTC GCC GGA AAG CAG CTT 1070 
Pro Pro Asp Gin Gin Arg Leu He Phe Ala Gly Lys Gin Leu 
20 40 45 50 

GAG GAT GGT CGT ACT CTT GCC GAC TAG AAC ATC CAG AAG GAG 1112 
Glu Asp Gly Arg Thr Leu Ala Asp Tyr Asn He Gin Lys Glu 
55 60 

25 

TCA ACT CTC CAT CTC GTG CTC CGT CTC CGT GGT GGT GGA TCC 1154 
Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly Gly Ser 
65 70 75 



30 Parole 3 

(Vm sri-nr-r inn nf TTbicmitin-lv t ; r Pentirlp Fusion Plasmids With 
Pnlvribimiirin Promoter nnd Intron (Ubi7) 



Exemplary and preferred pUC19 and pCGN1547 plasmid vectors 
35 containing a potato ( Solan urn tuberosum) polyubiquitin promoter and 
intron (Ubi7)', a region coding for a ubiquitin polypeptide, and a 
gene coding for a lytic peptide are constructed. 

To obtain the genomic clone containing a polyubiquitin 
promoter, intron and ubiquitin polypeptide coding region, a XFIXII 

40 potato genomic library was first prescreened using PCS. as 

described in Example 2 above. The PCR primers are homologous to 
regions of the polyubiquitin cDMA (see Garbarino J., ee al., Plant 
Molecular Biology 20: 235(1992)). A primer homologous to the 5' 
untranslated region of ubiquitin in the polyubiquitin cDNA and a 

45 primer complementary to the amino terminus of the ubiquitin coding 
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region in the polyubiquitin cDNA are used. A genomic clone 
containing both the polyubiquitin promoter region, intron, and the 
polyubiquitin coding region was identified. 

A chimeric gene is then constructed with a portion of the 
5 potato polyubiquitin genomic clone ligated to a lytic peptide 
gene, as described in Example 2. PCR is used to generate the 
Ubi7 -ubiquitin portion of the chimeric gene. The Obi 7 promoter 
region includes the 1220 bp p r omoter and 568 bp intron upstream of 
the ubiquitin ATG, and the ubiquitin polypeptide coding region is 

10 228 bp plus 6 bp of a BamHX restriction site (SEQ ID NO. 96) . 
This plasmid is designated pUCUbi7. 

A nucleotide fragment coding for the lytic peptide 
(corresponding to the amino acid sequence SEQ ID NO. 98) is 
generated as described in Example 2. The resulting Ubi7 

15 ubiquitin- lytic peptide fusion gene construct corresponds to SEQ 
ID NO. 95. Unlike previous cloning attempts using the CaMV35S 
promoter as described in the Background section, the sequence does 
not reveal any point mutations in the lytic peptide sub-clones. 
The plasmid was stable in the E. calx host and did not inhibit its 

20 growth. 

The resulting pUC19 recombinant plasmid is shown in the 
plasmid map in Figure 2. The sequence for the PCR insert 
containing the polyubiquitin promoter, intron, and the ubiquitin 
coding region corresponds to SEQ ID NO. 96 in Table 3 below. The 

25 sequence for the chimeric Ubi7 ubiquitin-lytic peptide fusion gene 
construct corresponds to SEQ ID NO. 95 in Table 3 below. This 
plasmid is designated as pUCUbi7-LP98. 

The entire Ubi7 ubiquitin-lytic peptide fusion gene 
construct, including the polyadenylation site, is isolated from 

30 pUC19 as an Asp7 IB /partial HindXII restriction fragment (the 
intron has an internal HindXII site) and sub-cloned into the 
PCGN1547 Agrobacterium vector for use in plant transformation. 
This plasmid is designated pCGNUbi7 -LP98 . 
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ThBLM 3: HOGLSOTXDB SBQOBSCB OF POTATO POIiTTTBIQ niTlH FROHQTtt 
HSOZOH (OBT7) JUD> OBXQOZTZN COS3ZRJ RBQXON ZBSZRT, AHD OBIQ OITHI- 
IRIC PBPTZSB FUSION QBB CUNS'XKUUT 



5 


SEO TD WV 


.25 












'l'l'I'ATCAATC 


AGATTTCAAC 


ATATAAATAA 


ATATAAATTG 


TCTCAATAAT 


50 




TCTACATTAA 


ACTAATATTT 


GAAATCTCAA 


TTTTATGATT 


TTTTAAATIC 


100 


10 


ACTTTATATC 


CAAGACAATT 


TNCAWTTTCA 


AAAAGTTITA 


TTAAAHACTT 


150 




ACATTAGTTT 


TGTTOATGAG 


GATGACAAGA 


TNTTCGTCAT 


CAATTACATA 


200 


15 


TACCCAAATT 


GAATAGTAAG 


CAACTTCAAT 


C"m*HUATA 


ATGATAATGA 


250 


CAGACACAAH 


NNAAACCCAT 


TTATTATTCA 


CATTGATTGA 


GTTTTATATG 


300 




CAATATAGTA 


ATAATAATAA 


TATITCTTAT 


AAAGCAAGAG 


GTCAATTTTT 


350 


20 


TTTTAATPAT 


ACCACGTCAC 


TAAATTATAT 


TTGATAA'IGT 


AAAACAATTC 


400 




AAATTTIACT 


TAAATATCAT 


GAAATAAACT 


ATTTTTATAA 


CCAAATTACT 


450 


25 


AAATTTTTCC 


AATAAAAAAA 


AGTCATTAAG 


AAGACATAAA 


ATAAATTTGA 


500 


GGTAAANGAG 


TGAAGTCGAC 


TGAtrrmTr 


TTTnTTATC 


ATAAGAAAAT 


550 




AAATTATTAA 


CTTEAACCTA 


ATAAAACACT 


AATATAA1TT 


CATGGAATCT 


600 


30 


AATACTTACC 


TCTTAGAAAT 


AAGAAAAAGT 


GTTTCTAATA 


GACCCTCAAT 


650 




TTACATTAAA 


TATTTOCAAT 


CAAATTTAAA 


TAACAAATAT 


CAATATGAGG 


700 


35 


TCAATAACAA 


TATCAAAATA 


ATATGAAAAA 


AGAGCAATAC 


ATAATATAAG 


750 


GGACGATTTA 


AGTGCGATEA 


TCAAGGTAGT 


ATTATATCCT 


AATTTCCTAA 


800 




TATTTGNGCT 


CEEATATTTA 


AGGTCATGTT 


CATGATAAAC 


TTGAAATGCG 


850 


40 


CTATATTAGA 


GCATATATTA 


AAATAAAAAA 


ATACCTAAAA 


TAAAATTAAG 


900 




TTATTTTTAG 


TATATATTTT 


TTTACATGAC 


CTACATTTTT 


CTGGGTTnT 


950 


45 


CTAAAGGAGC 
ATAAAAATTA 


GTGTAAGTGT 


CGACCTCATT 


CTCCTAATTT 


TCCCCACCAC 


1000 




AAAAGGAAAG 


GTAGCmTG 




TGGTACACTA 


1050 




CACCTCATTA 


TTACACGTGT 


CCTCATATAA 


TTGGTTAACC 


CTATGAGGCG 


1100 


50 


GTTTCGTCTA 


GAGTCGGCCA 


TGCCATCTAT 


AAAATGAAGC 


•rrrcTGCACc 


1150 




TCATTTTTTT 


CATCTTCTAT 


CTGATTTCTA 


TTATAATTTC 


TCTCAATTGC 


1200 
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10 



IS 



CTTCAAATTT CTCTTTAAGG TTAGAATCTT C T C T A ' m T T 1240 

GGTTTTTGTA TGTTTAGATT CTCGAATTAG CTAATCAGGC GCTGTTATAG 1290 

CCCTTCCTTT TGAGTCTCTC CTCGGTTGTC TTGATGGAAA AGGCCTAACA 1340 

TTTGAGTTTT TTTACGTCTC GTTTGATGGA AAAGGCCTAC AATTGGCCGT 1390 

TTTCCCCGTT CGTTTTGATG AAAAAGCCCC TAGTTTGAGA TTTTTTTTCT 1440 

GTCGTTCGTT CTAAAGGTTT AAAATTAGAG TTTTTACATT TCTTTCATGA 1490 

AAAAGGCCTT AAATTTCAGT TTTTCCGGTT GATTTGATGA AAAAGCCCTA 1540 

GAATTTGTGT TTTTCCGTCG GTTTGATTCT GAAGGCCTAA AATTTGAGTT 1590 

TCTCCGGCTG TTTTGATGAA AAAGCCCTAA ATTIGAGTTT CTCCGGCTCT 1640 

20 TTTGATGAAA AAGCCCTAAA TTTGAAGTTT TTTCCCCGTG TTTTAGATTG 1690 

TTTAGGTTTT AATTCTCGAA TCAGCTAATC AGGGAGTGTG AAAGCCCTAA 1740 

ATTGAAGTTT TTTTCGTTGT TCTGATTGTT GTTTTTATGA ATTTGCAG 1788 

215 

ATG CAG ATC TTT GTG AAA ACT CTC ACC GGA AAG ACT ATC ACC 1830 
Met Gin He Phe Val Lys Thr Leu Thr Gly Lys Thr He Thx 
15 10 

30 CTA GAG GTG GAA ACT TCT GAT ACA ATC GAC AAC GTT AAG GCT 1872 
Leu Glu Val Glu Ser Ser Asp Thr He Asp Asn Val Lys Ala 
15 20 25 

AAG ATC CAG GAT AAG GAA GGA ATT CCC CCG GAT CAG CAA AGG 1914 
35 Lys He Glu Asp Lys Glu Gly He Pro Pro Asp Gin Gin Arg 
30 35 40 

CTT ATC TTC GCC GGA AAG CAG TTG GAG GAC GGA CGT ACT CTA 1956 
Leu He Phe Ala Gly Lys Gin Leu Glu Asp Gly Arg Thr Leu 
40 45 ' 50 55 

GCT GAT TAC AAC ATC CAG AAG GAG TCT ACC CTC CAT TTG GTG 1998 
Ala Asp Tyr Asn He Gin Lys Glu Ser Thr Leu His Leu Val 
60 65 70 

45 

CTC CGT CTA CGT GGA GGT GGA TCC GCT GTT AAA AGA GTG GGT 2040 
Leu Arg Leu Arg Gly Gly Gly Ser Ala Val Lys Arg Val Gly 
75 80 

50 CGT AGG TTG AAA AAG TTG GAC CGT AAG ATT GAT AGG TTA GGA 2082 
Arg Arg Leu Lys Lys Leu Asp Arg Lys He Asp Arg Leu Gly 
85 90 95 
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GTT GAT TTT TCATCTAGAG TCGACCGATC CCCCGAATTT CCCCGA 2127 
Val Asp Phe 
IPO 



5 


SEO ID NO Ofi 
TTTATCAATC AGATTTSAAC 


ATATAAATAA 


ATATAAATTG 


TCTCAATAAT 


50 




TCTACA'IM'AA 




GAAA1CTCAA 


TTTTAT3ATT 


TTTTAAATTC 


100 


10 


ACTTTATATC 




TNCANCTTCA 


AAAAGTTTXA 


TTRAAMATTT - 


150 




ACATTAGTW 




GATGACAAGA 


TOTTGGTCAT 


CAATTACATA 


200 


1 c 


TACCCAAATT 


UnninuinAlj 


CAACTTCAAT 


GTTTTTCATA 


ATCATAATGA 


250 


CAGACACAAN 


NNAAAf^PPVt* 


TTATTATTCA 


CATTGATTOA 


GTTTTATATG 


300 




CAATATACTA 


ATAATAATAA 


TATTTCTTAT 


AAAGCAAGAG 


GTCAATTnT 


350 


20 


TTTTAATTAT 




TAAATTATAT 


TTGATAATGT 


AAAACAATIC 


400 




AAATTTTACT 


TAAATATCAT 


GAAATAAACT 


ATTTTEATAA 


CCAAATTACT 


450 


25 


AAATTTTTCC 


AATAAAAAAA 


AGTCATTAAG 


AAGACATAAA 


ATAAATTTGA 


500 




lUUtta J/UGAC 


TGAcmrrrr 


TTTOTTATC 


ATRAGAAAAT 


550 




AAATTATTAA 


CTTTAACCTA 


ATAAAACACT 


AATATAATTT 


CATGGAATCT 


600 


30 


AATACTTACC 


7CTTAGAAAT 


AAGAAAAAGT 


GTTTCTAATA 


GACCCTCAAT 


650 




TTACATTAAA 


TATTTTCAAT 


CAAATTTAAA 


TAACAAATAT 


CAATATGAGG 


700 


35 


TCAATAACAA 


TATCAAAATA 


ATATGAAAAA 


AGAGCAATAC 


ATAATATAAG 


750 


GGACGATTTA 


AGTGCGATTA 


TCAAGGTAGT 


ATE&TATCCT 


AATTTCCTAA 


800 




TATTTGMGCT 


CTTATATITA 


AGGTCATGTT 


CATGATAAAC 


TTGAAATGCG 


850 


40 


CTATATTAGA 


GCATATATTA 


AAATAAAAAA 


ATACCTAAAA 


TAAAATTAAG 


900 




TTATTTITAG 


TATATA'I'lVr 


TTTACATGAC 


CTACATXTIT 


CTCGGTTTTT 


950 


45 


CTAAAGGAGC 
ATAAAAATTA 


GTGTAAGTGT 
AAAAGGAAAG 


CGACCTCATT 


CTCCTAATTT 


TCCCCACCAC 


1000 




GTAGCTTTTG 




TGGTACACTA 


1050 




CACCTCATTA 


TTACACGTCT 


CCTCATATAA 


TTGGTTAACC 


CTATGAGGCG 


1100 


50 


GTTTCGTCTA 


GAGTCGGCCA 


TGCCATCTAT 


AAAATGAAGC 


TTTCTGCACC 


1150 




TCATITTCTT 


CATCTTCTAT 


CTGATTTCTA 


TTATAATTTC 


TCTCAATTSC 


1200 
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CTTCAAATTT CTCTTTAAGG TTAGAATCTT CTCTATTTTT 

G GTTTTTGT A TGTTTAGATT CTCGAATTAG CTAATCAGGC GCTGTTATAG 

CC CTTCCTTT TGA GTCTCTC CTCGGTTGTC TTGATGGAAA AGGCCTAACA 

TITGAGTTTT TTTACGTCTG GTTTGATGGA AAAGGCCTAC AATTGGCCGT 

TTTCCCCGTT CGTTTTGATG AAAAAGCCCC TAGTTTGAGA TTITTTTTCT 

GTCGTTCGTT CTAAAGGTTT AAAATTAGAG TTTTTACATT TGTTTGATGA 

AAAAGGCCTT AAATTIGAGT TTTTCCGGTT GATTTGATGA AAAAGCCCTA 

GAA TTTG T GT TTTTCCGTEG GTTTGATTCT GAAGGCCTAA AATTTGAGTT 

TCTCCGGCTG TTTTGATGAA AAAGCCCTAA ATTTGAGTTT CTCCGGCTGT 

TTTGATCAAA AAGCCCTAAA TTTGAAGTTT TTTCCCCGTG TTTTAGATTG 

TTTAGGTTTT AATTCTCGAA TCAGCTAATC AGGGAGTGTG AAAGCCCTAA 

ATTGAAGTTT VmUJl ' lU T TCTGATTGTT GTTTTTATGA ATTTGCAG 

ATG CAG ATC ITT GTG AAA ACT CTC ACC GGA AAG ACT ATC ACC 
Met Gin He Phe Val Lys Thr Leu Thr Gly Lys Thr He Tnr 
1 5 10 

CTA GAG GTG GAA ACT TCT GAT ACA ATC GAC AAC GTT AAG GCT 
Leu Glu val Glu Ser Ser Asp Thr He Asp Asn Val Lys Ala 
15 20 25 

AAG ATC CAG GAT AAG GAA GGA ATT CCC CCG GAT CAG CAA AGG 
Lys He Glu Asp Lys Glu Gly He Pro Pro Asp Gin Gin Arg 
30 35 4° 

CTT ATC TTC GCC GGA AAG CAG TTG GAG GAC GGA CGT ACT CTA 
Leu He Phe Ala Gly Lys Gin Leu Glu Asp Gly Arg Thr Leu 
45 50 55 

GCT GAT TAC AAC ATC CAG AAG GAG TCT ACC CTC CAT TTG GTG 
Ala Asp Tyr Asn He Gin Lya Glu Ser Thr Leu His Leu Val 
60 65 '0 

45 CTC CGT CTA CGT GGA GGT GGA TCC 
Leu Arg Leu Arg Gly Gly Gly Ser 
7S 



10 



15 



20 



25 



30 



35 



40 



1240 
1290 
1340 
1390 
1440 
1490 
1540 
1590 
1640 
1690 
1740 
1788 
1830 

1B72 

1914 

1956 

1998 



2022 



42 

SUBSTITUTE SHEET (RULE 26) 



WO 96/03519 



PCT7US95/09339 



ConarniCfinn nf TThi qui tin -Lytic VmriAf FusS^ n Rmn PTamrirt 

vectora 

5 pOC19 and pCGN1547 pla sm id vectors containing a potato 

(Solaaum tuberosum) Ubi3 promoter, a region coding for a ubiquitin 
polypeptide, and a gene coding for a lytic peptide are constructed 
according to Example 2 . Each plasmid respectively contains one 
lytic peptide nucleotide sequence coding for an minn acid 

10 sequence corresponding to SEQ ID NO. l, 7, 15, 21, 30, 39. 43, 52, 
83, 86, 88, 90, and 91. The resultant pOC19 Ubi3 ubiquitin-lytic 
peptide recombinant plasmids are designated as follows: pUCUbi3- 
LPl, pOCUbi3-LP7, pUC0bi3-LP15, pUCUbi3-LP21, pUOIbi3-LP30, 
pUCUbi3-LP39, pUCObi3-LP43, pUCUbi3 -LP52 , pUCUbi3-LP83, pUCUbi3- 

15 LP86, pUOTbi3-LP88, pUCObi3-LP90, and pUCDbi3-LP91. The resultant 
PCGN1547 Ubi3 ubiquitin-lytic peptide recombinant plasmids are 
designated as follows: pCGNUbi3-LPl, pCGNObi3-LP7, pCGN0bi3-LP15, 
pCGNUbi3-LP21, DCGNUbi3-LP30, pCGNUbi3-LP39. pCGNUbi3-LP43. 
pCGNObi3-LP52, pCGNDbi3-LP83, pCGNDbi3-LP86, pCGNObi3-LP88, 

20 pCGNObi3-LP90, and pCGNObi3 -LP91 . 

PUC19 and pCGN1547 plasmid vectors containing a potato 
(Solaaum tuberosum) Ubi7 promoter and intron,' a region coding for 
a ubiquitin polypeptide, and a gene coding for a lytic peptide are 
constructed according to Example 3 . Each plasmid respectively 

25 contains one lytic peptide nucleotide sequence coding for an "wip" 
acid sequence corresponding to SEQ ID NO. 1, 7, 15, 21, 30, 39, 
43, 52, 83, 85, 88, 90, and 91. The resultant pUC19 Obi7 
ubiquitin-lytic peptide recombinant plasmids are designated as 
follows: pOCObi7-LPl, pUOJbi7-LP7, pDC0bi7-LP15, pUCUbi7-LP21, 

30 pUCUbi7-LP30, pOCDbi7-LP39, pUCUbi7-t,P43 , pUCUbi7-LP52, pOCUbi7- 
LP83, pOCUbi7-LP86, pUCDbi7-LP88, ptJCUbi7-LP90, and pUCUbi7-LP91. 
The resultant pCGN1547 0bi7 ubiquitin-lytic peptide recombinant 
plasmids are designated as follows: pCGNUbi7-LPl, pCGNObi7-LP7, 
pCGN0bi7-LP15, pCGNUbi7-LP21, pCGNObi7-LP30, pCGNCbi7-LP39, 

35 pCGN0bi7-LP43, pCGNUbi7-LP52, pCGNUbi7-LP83 , pCGNDbi7-LP86, 
pCGNObi7-LP88, pCGNDbi7-LP90, and pCGN0bi7-LP91 . 

43 

SUBSTITUTE SHEET (RULE 26) 



WO 96/03519 



PCT/US95/09339 



BtamplB.S 

Conarnirtion of nnS-lTbimiir.jn Fiia-inn n»nP. B.rnnihi nm t; nwa 
Molecules and nbiouifcin PrmwtPT-flns Rocrrmhi nurif HW Mnlsnilw 

5 

Two chimeric genes containing a S-glucuronidase (GUS) 
reporter gene and the Ubi3 promoter were constructed inpCGOT.547 
plasmid vectors according to Garbarino, J., and Belknap, w.. Plant 
Molecular Biology 24: 119 (1994), hereby incorporated by reference 

10 in its entirety. The first vector contains the 920 bp Ubi3 

promoter ligated to the GOS gene, and expresses the GUS protein. 
This plasmid is designated pCGN0bi3 -GOS. The second vector 
contains the 920 bp Ubi3 promoter and 228 bp ubiquitin coding 
region ligated in frame to the GOS gene. This plasmid expresses a 

15 ubiquitin -GOS fusion polypeptide. This plasmid is designated 
pCGNUbi 3 -GUS f . 

Two chimeric genes containing a S-glucuronidase (GUS) 
reporter gene and the Ubi7 promoter minus the intron region were 
constructed in pCGN1547 plasmid vectors using PCR, as described in 

20 Example 3 and in Garbarino, J., and Belknap, W. , Plant Molecular 
Biology 24: 119 (1994). The first vector contains a 1156 bp 0bi7 
promoter region insert, including the 5' untranslated region of 
ubiquitin, ligated to the GOS gene. This plasmid does not contain 
the Ubi7 intron and expresses the GOS protein. This plasmid is 

25 designated pCGNObi7-GOS. The second vector contains the 1156 Ubi7 
ubiquitin promoter from pCGNObi7-GOS and the 228 bp ubiquitin 
coding region' fused in frame to the GOS reporter gene . This 
plasmid expresses a ubiquitin -GUS fusion polypeptide and is 
designated pCGNObi7 -GOSf . 

30 

£2BBDDlfi_i 

Plant Transformation and CTTS Gene Expression 

The chimeric plasmids pCGNUbi3-GOS, pCGN0bi3-GOSf , pCGN0bi7- 
35 GUS, and pCGNUbi7-GUSf from. Example 5 are introduced into the 
potato (Sol an urn tuberosum) using Agrobacteriujn mediated 
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transformation according to Garbarino, J., and Belknap, w. Plant 
Molecular Biology 24:119 (1994). The strain of Agrabacterium 
tumefaciens used for transformation (PC2760. see An, G. , et al., 
EMBO J. 4: 277 (1985)) harbors the disarmed Ti plasmid pAL4404 
5 (see Hoekema. A., et al., Nature 303: 179 (1983). Plant 

transformation is carried out as previously described in Synder, 
G.W., et al.. Plant Cell Rep 12:324 (1993), except that- 1 rag/1 
silver thiosulfate is added to the stage II transformation medium 
(see Chang, H.H., et al., Bot Bull Acad Sci 32: 63 (1991). 

10 expression of the ubiquitin-GOS fusion polypeptide and mRNA 

products and the GOS protein -alone is examined by northern and 
western analysis, as described in Garbarino J., and Belknap, W., 
Plant Molecular Biology 24: 119 (1994). GOS protein expression is 
examined in the transgenic plants using western analysis. 

15 Although there is a wide range of activity among individual 

clones, the ubiquitin-GUS fusion polypeptide containing plants 
generally give 5-10 fold higher expression than the plants 
containing GOS protein alone. This higher level of protein 
expression corresponds to similarly elevated mRNA transcription 

20 levels for the ubiquitin-GUS fusion constructs, as shown by 

northern analysis (described in Garbarino et al.. Plant Molecular 
Biology 24: 119 (1994)). Western analysis also shows that the 
ubiquitin-GDS fusion polypeptide was appropriately processed by 
endogenous ubiquitin hydrolases to produce free GOS protein. 

25 GOS protein activity is measured as described by Jefferson, 

R.A., et al., OCBO J. 6: 3901 (1987). Table 4 below shows a 
comparison of, the GOS activities in plants transformed with 
pCGHDbi3-GOS (ubi-) and plants transformed with pCGNObi3-GOSf 
(ubi+) . The activity is measured in nmoles methyl umbellif eron 

30 (MO) per minute per milligram of protein. Methyl umbelliferon is 
the fluorescent product of the GOS enzymatic reaction. 
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caratruct 




stein) 






iMt 1 


2nd Ltaf 


5th Leaf 


Senescent 


Tuber 




6.31±0.74 


2.51±0.52 


1. 79*0.22 


5.42±1.24 


3.26±0.27 


8.1-ubi 


25.8±2.08 


9.98±2.10 


6.34±1.00 


19.20±6.11 


14.2±1.6 


3 . 5f ubi 


94.8±12.6 


60.3±25.1 


32.7±8.71 


50.1±11.6 


37.6±10.4 




33.3±0.5 


18.9±2.75 


9.74±0.99 


22.7±3.57 


20.7*3.45 



5 

TUtnrnnlfi 7. 



The chimeric plasmids pCGNUbi3-LP98 from Example 2 and 
10 pCGNUbi7-I»P9B from Example 3 are introduced inco the potato 

(Solanum tuberosum) using Agrobacteriuni mediated transformation 
according to Garbarino, J., and Belknap. W. Plant Molecular 
Biology 24:119 (1994) . The strain of Agrobacterimn eumefaciens 
used for transformation (PC2760, see An, G. , et al., EMBO J. 4: 
15 277 (1985)) harbors the disarmed Ti plastmid pAL4404 (see Hoekema, 
A., et al.. Nature 303: 179 (19B3) . Plant transformation is 
carried out as. previously described in Syhder, G.W., ec Plant 
Cell Rep 12:324 (1993). except that 1 mg/1 silver thiosulfate is 
added to the stage II transformation medium (see Chang, H.H., et 
20 al., Bot Bull Acad Sci 32: 63 (1991). 

Expression of the ubiquitin-lytic peptide fusion polypeptide 
a nd mPNA products is examined by northern and western analysis , ai 
described in Example 6 and Garbarino J., and Belknap, w.. Plant 
Molecular Biology 24: 119 (1994). Northern analysis shows that 
25 ubiquitin-lytic peptide mRNA is transcribed from the gene 

construct in the transgenic plants. Western analysis shows that 
the ubiquitin-lytic peptide fusion polypeptide is appropriately 
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processed by endogenous ubiquitin hydrolases to produce free lytic 
peptide. 

5 Cloned Ohi1/Hbi7 Promoter Activity 

mRNA expression frost the cloned Ubi3 promoter was examined 
before and after wounding to determine if the cloned Ubi3 promoter 
is wound inducible in transformed plants (see Garbarino, J. and 

10 Belknap, W., Plant Molecular Biology 24:119 (1994)). Northern 
analysis comparing endogenous Dbi3 mRNA expression levels to 
pCGNUbi3-GUS and pCGNUbi3 -GUSf mRNA expression levels in 
transformed plants (see Example 5) shows that while the endogenous 
Ubi3 mRNA transcription increases upon wounding, transcription 

15 from the recombinant Ubi3 plasmids does not. Thus the recombinant 
Ubi3 promoter does not have the wound inducible characteristic of 
the endogenous Ubi3 promoter. This result suggests that the 920 
bp of upstream sequence cloned in the Ubi3 genomic clone is not 
sufficient to obtain wound-dependent activation of the promoter. 

20 The promoter instead is constitutive, however, it still 

demonstrates developmental regulation, as shown in Table 4 above. 

In contrast, the cloned Ubi7 promoter retains its wound- 
dependent induction. Northern analysis comparing the endogenous 
Ubi7 mRNA expression levels to the expression levels from 

25 pCGNObi7-GUS and pCGNUbi7-GtTSf in transformed plants (see Example 
5) shows that both the endogenous and the cloned Ubi7 promoter 
have wound-dependent activation. 
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DBPOSZT XHFaRM&TZOH 

E. cali cultures, each respectively transformed with pUC0bi7- 
5 LP98 (Local Accession No. POT-0273). pUCUbi3-LP98 (Local Accession 
No. PBT-0276), pTJC0bi7 (Local Accession No. PBT-0277) , and pOCObi3 
(Local Accession No. PBT-0234) were deposited in the Agricultural 
Research Service Culture Collection (NHRL) . The depository is 
located at located at 1815 North University Street. Peoria, 1L, 
10 61604. 



48 

SUBSTITUTE SHEET (RULE 26) 



WO 96/03519 



PCT/US95/09339 



SEQUENCE LISTING 
(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: DEMETER BIOTECHNOLOGIES , LTD. 

(B) STREET: 905 W. MAIN ST., BRIGHTLEAF SQUARE STE 1S-D 

(C) CITY: DURHAM 

(D) STATE: NORTH CAROLINA 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 27701 

(G) TELEPHONE: (919)682-7181 

(H) TELEFAX: (919)682-8340 

(i) APPLICANT: 

(A) NAME: UNITED STATES OF AMERICA, AS REPRESENTED BY 

THE SECRETARY OF AGRICULTURE 

(B) STREET: 

(C) CITY: WASHINTON 

(D) STATE: D. C. 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 20250-1400 

(G) TELEPHONE: 

(H) TELEFAX: 

(ii) TITLE OF INVENTION: UBIQUITIN-LYTIC PEPTIDE FUSION GENE 
CONSTRUCTS , PROTEIN PRODUCTS DERIVING THEREFROM , AND 
METHODS OF MAKING AND USING THE SAME 

(iii) NUMBER OF SEQUENCES : 98 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC coiroatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: WORDPERFECT 5.1+ 

(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 21-JUL-1994 

(vi) PRIOR APPLICATION DATA: 08/279, 472 

(A) APPLICATION NUMBER: 08/279,472 

(B) FILING DATE: 22-JUL-1994 
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(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 

Phe Ala Val Ala Val Lys Ala Val Lys Lys Ala Val Lys Lys Val Lys 
15 10 15 

Lys Ala Val Lys Lys Ala Val Lys Lys Lys Lye 
20 25 



(3) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL : NO 

(v) • FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 

Phe Ala Val Ala Val Lys Ala Val Ala Val Lys Ala Val Lys Lys Ala 
15 10 15 

Val Lys Lys Val Lys Lys Ala Val Lys Lys Ala Val Lys Lys Lys Lys 
20 25 30 
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(4) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 37 

(B) TYPE: AMINO ACID 
<C> TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 

Phe Ala Val Ala Val Lys Ala Val Ala Val Lye Ala Val Ala Val Lya 
15 10 15 

Ala Val Lys Lys Ala Val Lys Lys Val Lys Lys Ala Val Lys Lys Ala 
20 25 30 

Val Lys Lys Lys Lys 
35 

(5) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 

Phe Ala Val Ala Val Lys Ala Val Lys Lys Ala Val Lys Lys Val Lys 
15 10 15 

Lys Ala Val Lys Lys Ala Val 
20 
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(6) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 

(B) TYPE: AMINO ACID 
tC) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
( i i i ) HYPOTHETICAL : NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 

Pbe Ala Val Ala Val Lys Ala Val Ala Val Lys Ala Val Lys Lye Ala 
15 10 15 

Val Lys Lys Val Lys Lys Ala Val Lys Lys Ala Val 
20 25 

(7) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 

Phe Ala Val Ala Val Lys Ala Val Ala Val Lys Ala Val Ala Val Lys 
15 10 15 

Ala Val Lys Lys Ala Val Lys Lys Val Lys Lys Ala Val Lys Lys Ala 
20 25 30 

Val 
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(8) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 7 

Phe Ala Val Gly Leu Arg Ala lie Lye Arg Ala Leu Lys Lys Leu Arg 
1 5 10 15 

Arg Gly Val Arg Lys Val Ala Lys Arg Lys Arg 
20 25 

(9) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(V) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 

Phe Ala Val Gly Leu Arg Ala lie Lys Arg Ala Leu Lys Lys Leu Arg 
1 5 10 15 

Arg Gly Val Arg Lys Val Ala 
20 
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(10) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

( vi i ) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 

Lys Arg Lys Arg Ala Val Lys Arg Val Gly Arg Arg Leu Lys Lys Leu 
1 5 * 10 15 

Ala Arg Lys He Ala Arg Leu Gly Val Ala Phe 
20 25 

(11) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 

Ala Val Lys Arg Val Gly Arg Arg Leu Lys Lys Leu Ala Arg Lys He 
15. 10 15 

Ala Arg Leu Gly Val Ala Phe 
20 
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(12) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 31 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 
(Vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 

Phe Ala Val Gly Leu Arg Ala lie Lys Arg Ala Leu Lys Lys Leu Arg 
1 5 10 15 

Arg Gly Val Arg Lys Val Ala Lys Arg Lys Arg Lys Lys Asp Leu 
20 25 30 



(13) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 
(Vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 

Phe Ala Val Gly Leu Arg Ala lie Lys Arg Ala Leu Lys Lys Leu Arg 
1 5 10 15 

Arg Gly Val Arg Lys Val Ala Lys Asp Leu 
20 25 
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(14) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE : 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X> PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

Lys Arg Lys Arg Ala Val Lys Arg Val Gly Arg Arg Leu Lys Lys Leu 
15 10 15 

Ala Arg Lys He Ala Arg Leu Gly Val Ala Phe Lys Asp Leu 
20 25 30 

(15) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE : SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 

Ala Val Lys Arg Val Gly Arg Arg Leu Lys Lys Leu Ala Arg Lys He 
1 " 5 10 15 

Ala Arg Leu Gly Val Ala Phe Lys Asp Leu 
20 25 
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(16) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 
(ii> MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 

Lys Lys Lys Lys Phe Val Lys Lys Val Ala Lys Lys Val Lys Lya Val 
15 10 15 

Ala Lys Lys Val Ala Lys Val Ala Val Ala Val 
20 25 

(17) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
(Iii) HYPOTHETICAL: NO 

(V) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 

Lys Lys Lys Lys Phe Val Lys Lys Val Ala Lys Lys Val Lys Lys Val 
15 10 15 

Ala Lys Lys Val Ala Lys Val Ala Val Ala Lys Val Ala Val Ala Val 
20 25 30 
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(18) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 

Lys Lye Lye Lys Phe Val Lys Lys Val Ala Lys Lys Val Lys Lys Val 
15 10 15 

Ala Lys Lys Val Ala Lys Val Ala Val Ala Lys Val Ala Val Ala Lys 
20 25 30 

Val Ala Val Ala Val 
35 

(19) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 
<vi) ORIGINAL SOURCE: SYNTHETIC 
(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 

Phe Val Lys Lys Val Ala Lys Lys Val Lys Lys Val Ala Lys Lys Val 

1 5 10 15 

Ala Lys Val Ala Val Ala Val 
20 
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(20) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

( iii ) HYPOTHETICAL : NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 
(Vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 

Phe Val Lys Lye Val Ala Lys Lye Val Lys Lye Val Ala Lys Lys Val 
15 10 15 

Ala Lys Val Ala Val Ala Lys Val Ala Val Ala Val 
20 25 

(21) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY : LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 
(Vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 

Phe Val Lys Lys Val Ala Lys Lys Val Lys Lys Val Ala Lys Lys Val 
15 10 15 

Ala Lys Val Ala Val Ala Lys Val Ala Val Ala Lys Val Ala Val Ala 
20 25 30 

Val 
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(22) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 

Lye Lys Lys Lys Phe Val Lys Lys Val Ala Lys Val Ala Lys Lye Val 
1 5 10 15 

Ala Lys Val Ala Lys Lys Val Ala Lys Lys Val 
20 25 

(23) INFORMATION FOR SEQ ID NO: 22: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) - FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 

Lys Lys Lys Lys Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val 
15 10 15 

Ala Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala 
20 25 30 
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(24) INFORMATION FOR SEQ ID NOi 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE : COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 

Lys Lys Lys Lys Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val 
l 5 * 10 15 

Ala Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala 
20 25 30 

Lys Val Ala Lys Lys 
35 

(25) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: . 24 

Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val Ala Lys Val Ala 
1 5 10 15 

Lys Lys Val Ala Lys Lys Val 
20 



51 

SUBSTITUTE SHEET (RULE 26) 



WO 96/03519 



PCT/US95/09339 



(26) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
( i i i ) HYPOTHETICAL : NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 

Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val Ala Lys Val Ala 
15 10 15 

Lys Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala 
20 * 25 

(27) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
( i i i ) HYPOTHETICAL : NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 

Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val Ala Lys Val Ala 
15 10 15 

Lys Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala Lys Val Ala Lys 
20 25 30 

Lys 
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(28) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 
<ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
( i ii ) HYPOTHETICAL : NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: . 27 

Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val Ala Lys Val Ala 
15 10 15 

Lys Lys Val Ala Lys Lys Val Lys Lys Lys Lys 
20 25 

(29) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii ) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 

Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val Ala Lys Val Ala 
15 10 15 

Lys Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala Lys Lys Lys Lys 
20 25 30 
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(30) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 37 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
( i i i ) HYPOTHETICAL : NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 
(Vii) IMMEDIATE SOURCE: SYNTHETIC 

<x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 

Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val Ala Lys Val Ala 
15 10 IS 

Lys Lye Val Ala Lys Lys Val Ala Lys Lys Val Ala Lys Val Ala Lys 
20 " 25 30 

Lys Lys Lys Lys Lys 
35 

(31) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

<v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 



Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys Lys Lys Lys Lys 
1 5 10 15 
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(32) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 

Phe Lys Val Lye Ala Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 
1 5 10 15 

Ala Lys Lys Lys Lys 
20 

(33) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v> ' FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 

Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 
15 10 15 

Ala Lys Val Lys Ala Lys Val Lys Lys Lys Lys 
20 25 
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(34) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 

Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 
1 5 10 

(35) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(V) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 

Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 
1 5 10 15 

Ala 
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(3 6) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE : AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(V) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 

Phe Lye Val Lys Ala Lys Val Lys Ala Lys Val Lys Ala Lye Val Lys 
1 5 10 15 

Ala Lys Val Lys Ala Lys Val 
20 

(37) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 
(Vi) ORIGINAL SOURCE: SYNTHETIC 
(vii ) IMMEDIATE SOURCE : SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 

Lys Lys Lys Lys Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 
1 5 10 15 
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(38) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY : LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
( i 2 i ) HYPOTHETICAL : NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 
(Vi) ORIGINAL SOURCE: SYNTHETIC 
(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 

Lys Lys Lys Lys Pbe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lye 
15 10 15 

Ala Lys Val Lys Ala 
20 

(39) INFORMATION FOR SEQ ID NO : 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(V) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 

Lys Lys Lys Lys Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 
1 5 " 10 15 

Ala Lys Val Ly6 Ala Lys Val Lys Ala Lys Val 
20 25 
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(40) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 

Phe Lys Lys Val Lys Lys Val Ala Lys Lys Val Cys Lys Cys Val Lys 
15 10 15 

Lys Ala Val Lys Lys Val Lys Lys Phe 
20 25 

(41) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 

Phe Ala Val Ala Val Lys Ala Val Lys Lys Ala Val Lys Lys Val Lys 
15 10 15 

Lys Ala Val Lys Lys Ala Val Cys Cys Cys Cys 
20 25 
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(42) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 
<vi) ORIGINAL SOURCE: SYNTHETIC 
(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 

Cys Cys Cys Cys Phe Val Lys Lys Val Ala Lys Lys Val Lys Lys Val 
15 10 15 

Ala Lys Lys Val Ala Lys Val Ala Val Ala Val 
20 25 

(43) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 42 

Phe Ala Val Ala Val Lys Ala Val Lys Lys Ala Val Lys Lys Val Lys 
15 10 15 

Lys Ala Val Lys Lys Ala Val Ser Ser Ser Ser 
20 25 
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(44) INFORMATION FOR SEQ ID NO:' 43: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 
(Vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 

Ser Ser Ser Ser Phe Val Lys Lys Val Ala Lys Lys Val Lys Lys Val 
l 5 10 15 

Ala Lys Lys Val Ala Lys Val Ala Val Ala Val 
20 25 

(45) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE : AMINO ACID 

(C) TOPOLOGY : LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(V) FRAGMENT TYPE: COMPLETE PEPTIDE 
(Vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 

Phe Ala Leu Ala Leu Lys Ala Leu Lys Lys Ala Leu Lys Lys Leu Lys 
1 5 10 15 

Lys Ala Leu Lys Lys Ala Leu 
20 
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(46) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 
(Vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 45 

Leu Ala Lys Lys Leu Ala Lys Lys Leu Lys Lys Leu Ala Lys Lys Leu 
1 5 10 15 

Ala Lys Leu Ala Leu Ala Phe 
20 

(47) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY : LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 
(Vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46 

Phe Ala Phe Ala Phe Lys Ala Phe Lys Lys Ala Phe Lys Lys Phe Lys 
15 10 15 

Lys Ala Phe Lys Lys Ala Phe 
20 
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(48) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
(ili) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 

Phe Ala He Ala He Lys Ala He Lys Lys Ala He Lys Lys He Lye 
15 10 15 

Lys Ala He Lys Lys Ala He 
20 

(49) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 
(Vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48 

Phe Ala Lys Lys Phe Ala Lys Lys Phe Lys Lys Phe Ala Lys Lys Phe 
1 * 5 " 10 15 

Ala Lys Phe Ala Phe Ala Phe 
20 
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(50) INFORMATION FOR SEQ ID NO : ' 4 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49 

Phe Lys Arg Leu Ala Lye lie Lys val Leu Arg Leu Ala Lys lie Lye 
15 10 15 

Arg 

(51) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50 

Lys Leu Lys Leu Ala Val Lys Leu Val Gly Leu Leu Arg Lys Lys Arg 
15 10 15 

Ala Leu Lys lie Ala Leu Arg Gly Val Ala Lys Arg Ala Gly Arg Leu 
20 25 30 

Ala Val Arg Lys Phe 
35 
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(52) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii ) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 

Phe Ala Arg Ala Arg Lys Ala Arg Lys Lys Ala Arg Lys Lys Arg Lys 
15 10 15 

Lys Ala Arg Lys Lys Ala Arg Lys Asp Arg 
20 25 

(53) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) ' FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 

Phe Ala Val Ala Val Cys Ala Val Cys Cys Ala Val Cys Cys Val Cys 
15 10 15 

Cys Ala Val Cys Cys Ala Val 
20 
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(54) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 

Phe Ala Val Ala Val Ser Ala Val Ser Ser Ala Val Ser Ser Val Ser 
15 10 15 

Ser Ala Val Ser Ser Ala Val 
20 

(55) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(V) FRAGMENT TYPE: COMPLETE PEPTIDE 
(Vi) ORIGINAL SOURCE: SYNTHETIC 
(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 

Phe Ala Val Ala Val Ser Ala Val Ser Ser Ala Val Ser Ser Val Ser 
15 10 15 

Ser Ala Val Ser Ser Ala Val Ser Ser Ser Ser 
20 25 



76 

SUBSTITUTE SHEET (RULE 26) 



WO 96/03519 



PCT/US95/09339 



(56) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE : AMINO ACID 

(C) TOPOLOGY : LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 
(Vi) ORIGINAL SOURCE: SYNTHETIC 
(vii ) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 



Phe Ala Lys Lys Phe Ala Lys Lys 
1 5 

Ala Lys Phe Ala Phe Ala Phe Lys 
20 



Phe Lys Lys Phe Ala Lys Lys Phe 
10 15 

Lys Lys Lys 
25 



(57) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii ) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56 

Lys Lys Lys Lys Phe Ala Lys Lys Phe Ala Lys Lys Phe Lys Lys Phe 
1 5 * 10 15 

Ala Lys Lys Phe Ala Lys Phe Ala Phe Ala Phe 
20 * 25 
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(58) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57 

Phe Ala Arg Lys Phe Leu Lys Arg Phe Lys Lys Phe Val Arg Lys Phe 
15 io 15 

lie Arg Phe Ala Phe Leu Phe 
20 

(59) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY : LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) • FRAGMENT TYPE : COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58 

Phe Ala Arg Lys Phe Leu Lys Arg Phe Lys Lys Phe Val Arg Lys Phe 
1*5 10 15 

lie Arg Phe Ala Phe Leu Phe Lys Arg Lys Arg 
20 25 
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(60) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE : 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL : NO 

(V) FRAGMENT TYPE: COMPLETE PEPTIDE 
(Vi) ORIGINAL SOURCE: SYNTHETIC 
(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59 

Lys Arg Lys Arg Phe Ala Arg Lys Phe Leu Lys Arg Phe Lys Lys Phe 
1 5 ' 10 15 

Val Arg Lys Phe He Arg Phe Ala Phe Leu Phe 
20 25 

(61) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(V) FRAGMENT TYPE: COMPLETE PEPTIDE 
(Vi) ORIGINAL SOURCE: SYNTHETIC 
(vii ) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60 

He Ala Lys Lys He Ala Lys Lys He Lys Lys He Ala Lys Lys He 
15 10 15 

Ala Lys He Ala He Ala He 
20 
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(62) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61 

lie Ala Lys Lys lie Ala Lys Lys lie Lys Lys lie Ala Lys Lya He 
1*5 10 15 

Ala Lye He Ala He Ala He Lys Lys Lys Lys 
20 25 

(63) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE : AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 

Lys Lys Lys Lys He Ala Lys Lys lie Ala Lys Lys lie Lys Lys He 
1 5 10 15 

Ala Lys Lys He Ala Lys lie Ala lie Ala He 
20 25 
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(64) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE : AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 

lie Ala Arg Lys He Leu Lys Arg He Lys Lys He Val Arg Lys Phe 
15 10 15 

He Arg He Ala He Leu He 
20 

(65) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION : NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 

He Ala Arg Lys He Leu Lys Arg He Lys Lys He Val Arg Lys Phe 
1 5 10 15 

He Arg He Ala He Leu He Lys Arg Lys Arg 
20 25 
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(66) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

<A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 
(Vi) ORIGINAL SOURCE: SYNTHETIC 
(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65 

Lys Arg Lys Arg He Ala Arg Lys He Leu Lys Arg He Lys Lys He 
1 5 10 15 

Val Arg Lys Phe He Arg He Ala He Leu He 
20 25 

(67) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66 

Phe Lys Leu Arg Ala Lys He Lys Val Arg Leu Arg Ala Lys He Lys 
1 5 10 15 

Leu 
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(68) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 
(Vi) ORIGINAL SOURCE: SYNTHETIC 
(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 

Lys Arg Lys Arg Phe Lys Leu Arg Ala Lys lie Lys Val Arg Leu Arg 
15 10 15 

Ala Lys lie Lys Leu 

20 

(69) INFORMATION FOR SEQ ID NO: 6B: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 
(Vii) IMMEDIATE SOURCE : SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6B 

Phe Lys Leu Arg Ala Lys lie Lys Val Arg Leu Arg Ala Lys lie Lys 
1 5 " 10 15 

Leu Lys Arg Lys Arg 
20 
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(70) INFORMATION FOR SEQ ID NO:' 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

( vi i ) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69 

Phe Lys Leu Arg Ala Lys lie Lys Val Arg Leu Arg Ala Lys lie Lys 
1 5 10 15 

Leu Arg Val Lys Leu Lys lie 
20 

(71) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE : SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70 

Phe Lys Leu Arg Ala Lys lie Lys Val Arg Leu Arg Ala Lys lie Lys 
1 5 10 15 

Leu Arg Val Lys Leu Lys lie Lys Arg Lys Arg 
20 25 
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(72) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY : LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 
<vi) ORIGINAL SOURCE: SYNTHETIC 
(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 71 

Lys Arg Lys Arg Phe Lye Leu Arg Ala Lys He Lys Val Arg Leu Arg 
1 5 10 15 

Ala Lys He Lys Leu Arg Val Lys Leu Lys He 
20 25 

(73) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY : LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETI CAL : NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 

Phe Lys Leu Arg Ala Lys He Lys Val Arg Leu Arg Ala Lys He Lys 
1 5 10 15 

Leu Arg Val Lys Leu Lys He Arg Ala Arg He Lys Leu 
20 25 
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(74) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
( i i i ) HYPOTHETICAL : NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 

Phe Lys Leu Arg Ala Lys He Lys Val Arg Leu Arg Ala Lye He Lys 
1 5 10 15 

Leu Arg Val Lys Leu Lys He Arg Ala Arg He Lys Leu Lys Arg Lye 
20 25 30 

Arg 

(75) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 

Lys Arg Lys Arg Phe Lys Leu Arg Ala Lys He Lys Val Arg Leu Arg 
1 5 10 15 

Ala Lys He Lys Leu Arg Val Lys Leu Lys He Arg Ala Arg He Lys 
20 25 30 

Leu 
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(76) INFORMATION FOR SEQ ID NO: 75: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(V) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75 

Phe Lys Leu Arg Ala Lys lie Lys Val Arg Leu Arg Ala Lys lie Lys ' 
1 5 10 15 

Leu Val Phe Ala lie Leu Leu 
20 

(77) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(V) FRAGMENT TYPE: COMPLETE PEPTIDE 
(vi) ORIGINAL SOURCE: SYNTHETIC 
(Vii) IMMEDIATE SOURCE: SYNTHETIC 

<x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76 

Phe Lys Leu Arg Ala Lys lie Lys Val Arg Leu Arg Ala Lys He Lys 
15 10 15 

Leu Val Phe Ala He Leu Leu Lys Arg Lys Arg 
20 25 
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(7B) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 
<vi) ORIGINAL SOURCE: SYNTHETIC 
(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77 

Lys Arg Lys Arg Phe Lys Leu Arg Ala Lys lie Lys Val Arg Leu Arg 
15 10 15 

Ala Lys lie Lys Leu Val Phe Ala lie Leu Leu 
20 25 

(79) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
(iii (HYPOTHETICAL: NO 

(v) ' FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 

Val Phe Ala He Leu Leu Phe Lys Leu Arg Ala Lys He Lys Val Arg 
15 10 15 

Leu Arg Ala Lys He Lys Leu 
20 
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(80) INFORMATION FOR SEQ ID NO: '79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79 

Val Phe Ala lie Leu Leu Phe Lys Leu Arg Ala Lys He Lys Val Arg 
15 10 15 

Leu Arg Ala Lys He Lys Leu Lys Arg Lys Arg 
20 25 

(81) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 

<B) TYPE: AMINO ACID 
(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 80 

Lys Arg Lys Arg Val Phe Ala He Leu Leu Phe Lys Leu Arg Ala Lys 
15 10 15 

He Lys Val Arg Leu Arg Ala Lys He Lys Leu 

.20 25 
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(82) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v> FRAGMENT TYPE: COMPLETE PEPTIDE 
(vi) ORIGINAL SOURCE: SYNTHETIC 
( Vi i ) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81 

Val Gly Glu Cys Val Arg Gly Arg Cys Pro Ser Gly Met Cys Cys Ser 
1 5 10 15 

Gin Phe Gly Tyr Cys Gly Lys Gly Pro Lys Tyr Cys Gly 
20 25 

(83) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 
<ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDB 
(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 82 

Val Gly Glu Cys Val Arg Gly Arg Cys Pro Ser Gly Met Cys Cys Ser 
1 5 10 15 

Gin Phe Gly Tyr Cys Gly Lys Gly Pro Lys Tyr Cys Gly Arg 
20 25 30 
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(84) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

( i ii ) HYPOTHETICAL : NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 

Leu Gly Asp Cys Leu Lys Gly Lys Cys Pro Ser Gly Met Cys Cys Ser 
1 5 10 15 

Asn Tyr Gly Phe Cys Gly Arg Gly Pro Arg Phe Cys Gly Lys 
20 * 25 30 



(85) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 

Gin Cys lie Gly Asn Gly Gly Arg Cys Asn Glu Asn Val Gly Pro Pro 
1 5 " 10 15 

Tyr Cys Cys Ser Gly Phe Cys Leu Arg Gin Pro Gly Gin Gly Tyr Gly 
20 ' 25 30 

Tyr Cys Lys Asn Arg 
35 
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(86) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

<x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85 

Cys lie Gly Asn Gly Gly Arg Cys Asn Glu Asn Val Gly Pro Pro Tyr 
1 5 10 15 

Cys Cys Ser Gly Phe Cys Leu Arg Gin Pro Asn Gin Gly Tyr Gly Val 
20 ' 25 30 

Cys Arg Asn Arg 
35 

(87) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE : 

(A) DESCRIPTION: PEPTIDE 

(iii ) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86 

Cys lie Gly Gin Gly Gly Lys Cys Gin Asp Gin Leu Gly Pro Pro Phe 
1 5 10 15 

Cys Cys Ser Gly Tyr Cys Val Lys Asn Pro Gin Asn Gly Phe Gly Leu 
20 25 30 

Cys Lys Gin Lys 
35 
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(88) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii ) HYPOTHETICAL : NO 

<v) FRAGMENT TYPE: COMPLETE PEPTIDE 
(vi) ORIGINAL SOURCE: SYNTHETIC 
(Vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87 



Gin Lys Leu Cys Glu Arg Pro Ser 
1 5 

Asn Asn Asn Ala Cys Lys Asn Gin 
20 

His Gly Ser Cys Asn Tyr Val Phe 
35 40 



Gly Thr Trp Ser Gly Val Cys Gly 
10 15 

Cys lie Asn Leu Glu Lys Ala Arg 
25 30 

Pro Ala His Lys 



(89) INFORMATION FOR SEQ ID NO : 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) - MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
(ill) HYPOTHETICAL: NO 

(V) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88 

Gin Arg Val Cys Asp Lys Pro Ser Gly Thr Trp Ser Gly Leu Cys Gly 
1 5 10 15 

Asn Asn Asn Ala Cys Arg Gin Asn Cys lie Gin Val Asp Arg Ala Lys 
20 25 30 

Lys Gly Ser Cys Gin Phe Leu Tyr Pro Ala Lys Lys 
35 40 
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(90) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 

(B) TYPE : AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
( i i i ) HYPOTHETICAL : NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

<X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89 

Gin Lys Leu Cys Gin Arg Pro Ser Gly Thr Trp Ser Gly Val Cys Gly 
1 5 10 15 * 

Asn Asn Asn Ala Cys Lys Asn Gin Cys He Arg Leu Glu Lys Ala Arg 
20 25 30 

His Gly Ser Cys 
35 

(91) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(V) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90 

Gin Arg Val Cys Asn Lys Pro Ser Gly Thr Trp Ser Gly Leu Cys Gly 
1 5 10 15 

Asn Asn Asn Ala Cys Arg Gin Asn Cys He Lys Val Asp Arg Ala Lys 
20 25 30 

Lys Gly Ser Cys 
35 



94 



SUBSTITUTE SHEET (RULE 26) 



WO 96/03519 



PCT/US95/09339 



(92) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91 

Met Leu Glu Glu Leu Phe Glu Glu Met Thr Glu Phe He Glu Glu Val 
15 10 15 

He Glu Thr Met 
20 

(93) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1228 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE STRANDED 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: GENOMIC DNA AND OTHER NUCLEIC ACID 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


92 




CCAAAGCACA 


TACTTATCGA 


TTTAAATTTC 


ATCGAAGAGA 


TTAATATCGA 


50 


ATAATCATAT 


ACATACTTTA 


AATACATAAC 


AAATTTTAAA 


TACATATATC 


100 


TGGTATATAA 


TTAATTTTTT 


AAAGTCATGA 


AGTATGTATC 


AAATACACAT 


150 


ATGGAAAAAA 


TTAACTATTC 


ATAATTTAAA 


AAATAGAAAA 


GATACATCTA 


200 


GTGAAATTAG 


GTGCATGTAT 


CAAATACATT 


AGGAAAAGGG 


CATATATCTT 


250 


GATCTAGATA 


ATTAACGATT 


TTGATTTATG 


TATAATTTCC 


AAATGAAGGT 


300 


TTATATCTAC 


TTCAGAAATA 


ACAATATACT 


TTTATCAGAA 


CATTCAACAA 


350 


AGCAACAACC 


AACTAGAGTG 


AAAAATACAC 


ATTGTTCTCT 


AGACATACAA 


400 


AATTGAGAAA 


AGAATCTCAA 


AATTTAGAGA 


AACAAATCTG 


AATTTCTAGA 


450 


AGAAAAAAAT 


AATTATGCAC 


TTTGCTATTG 


CTCGAAAAAT 


AAATGAAAGA 


500 


AATTAGACTT 


TTTTAAAAGA 


TGTTAGACTA 


GATATACTCA 


AAAGCTATTA 


550 


AAGGAGTAAT 


ATTCTTCTTA 


CATTAAGTAT 


TTTAGTTACA 


GTCCTGTAAT 


600 
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TAAAGACACA 


TTTTAGATTG 


TATCTAAACT 


TAAATGTATC TAGAATACAT 


650 


ATATTTGAAT 


GCATCATATA 


CATGTATCCG 


ACACACCAAT TCTCATAAAA 


700 


AACGTAATAT 


CCTAAACTAA 


TTTATCCTTC 


AAGTCAACTT AAGCCCAATA 


750 


TACATTTTCA 


TCTCTAAAGG 


CCCAAGTGGC 


ACAAAATGTC AGGCCCAATT 


800 


ACGAAGAAAA 


GGGCTTGTAA 


AACCCTAATA 


AAGTGGCACT GGCAGAGCTT 


850 


ACACTCTCAT 


TCCATCAACA 


AAGAAACCCT 


AAAAGCCGCA GCGCCACTGA 


900 


TTTCTCTCCT 


CCAGGCGAAG 


ATG CAG ATC TTC GTG AAG ACC TTA 
Met Gin lie Phe Val Lys Thr Leu 


944 



ACG GGG AAG ACG ATC ACC CTA GAG GTT GAG TCT TCC GAC ACC 986 
Thr Gly Lys Thr lie Thr Leu Glu Val Glu Ser Ser Asp Thr 
10 15 20 

ATC GAC AAT GTC AAA GCC AAG ATC CAG GAC AAG GAA GGG ATT 1028 
lie Asp ash Val Lys Ala Lys lie Gin Asp Lys Glu Gly lie 
25 30 35 

CCC CCA GAC CAG CAG CGT TTG ATT TTC GCC GGA AAG CAG CTT 1070 
Pro Pro Asp Gin Gin Arg Leu He Phe Ala Gly Lys Gin Leu 
40 45 50 

GAG GAT GGT CGT ACT CTT GCC GAC TAC AAC ATC CAG AAG GAG 1112 
Glu Asp Gly Arg Thr Leu Ala Asp Tyr Asn He Gin Lys Glu 
55 60 

TCA ACT CTC CAT CTC GTG CTC CGT CTC CGT GGT GGT 1148 

Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly 
65 70 75 

GGA TCC GCT GTT AAA AGA GTG GGT CGT AGG TTG AAA AAG TTG 1190 

Gly Ser Ala Val Lys Arg Val Gly Arg Arg Leu Lys Lys Leu 
80 85 90 

GAC CGT AAG ATT GAT AGG TTA GGA GTT GAT TTT TGATC 1228 
Asp Arg Lys He Asp Arg Leu Gly Val Asp Phe 
95 100 



(94) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1154 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE STRANDED 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: GENOMIC DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 
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CCAAAGCACA 


TACTTATCGA 


TTTAAATTTC 


ATCGAAGAGA 


TTAATATCGA 


50 


ATAATCATAT 


ACATACTTTA 


AATACATAAC 


AAATTTTAAA 


TACATATATC 


100 


TGGTATATAA 


TTAATTTTTT 


AAAGTCATGA 


AGTATGTATC 


AAATACACAT 


150 


ATGGAAAAAA 


TTAACTATTC 


ATAATTTAAA 


AAATAGAAAA 


GATACATCTA 


200 


GTGAAATTAG 


GTGCATGTAT 


CAAATACATT 


AGGAAAAGGG 


CATATATCTT 


250 


GATCTAGATA 


ATTAACGATT 


TTGATTTATG 


TATAATTTCC 


AAATGAAGGT 


300 


TTATATCTAC 


TTCAGAAATA 


ACAATATACT 


TTTATCAGAA 


CATTCAACAA 


350 


AGCAACAACC 


AACTAGAGTG 


AAAAATACAC 


ATTGTTCTCT 


AGACATACAA 


400 


AATTGAGAAA 


AGAATCTCAA 


AATTTAGAGA 


AACAAATCTG 


AATTTCTAGA 


450 


AGAAAAAAAT 


AATTATGCAC 


TTTGCTATTG 


CTCGAAAAAT 


AAATGAAAGA 


500 


AATTAGACTT 


TTTTAAAAGA 


TGTTAGACTA 


GATATACTCA 


AAAGCTATTA 


550 


AAGGAGTAAT 


ATTCTTCTTA 


CATTAAGTAT 


TTTAGTTACA 


GTCCTGTAAT 


600 


TAAAGACACA 


TTTTAGATTG 


TATCTAAACT 


TAAATGTATC 


TAGAATACAT 


650 


m\ a n X x ± VJAn X 




CATGTATCCG 


ACACACCAAT 


TCTCATAAAA 


700 


AACGTAATAT 


CCTAAACTAA 


TTTATCCTTC 


AAGTCAACTT 


AAGCCCAATA 


750 


TACATTTTCA 


TCTCTAAAGG 


CCCAAGTGGC 


ACAAAATGTC 


AGGCCCAATT 


SOO 


ACGAAGAAAA 


GGGCTTGTAA 


AACCCTAATA 


AAGTGGCACT 


GGCAGAGCTT 


850 


ACACTCTCAT 


TCCATCAACA 


AAGAAACCCT 


AAAAGCCGCA 


GCGCCACTGA 


900 


TTTCTCTCCT 


CCAGGCGAAG 


ATG CAG ATC TTC GTG AAG ACC TTA 
Met Gin He Phe Val Lys Thr Leu 


944 



ACG GGG AAG ACG ATC ACC CTA GAG GTT GAG TCT TCC GAC ACC 986 
Thr Gly Lys Thr He Thr Leu Glu Val Glu Ser Ser Asp Thr 
10 * 15 20 

ATC GAC AAT GTC AAA GCC AAG ATC CAG GAC AAG GAA GGG ATT 1028 
He Asp Asn Val Lys Ala Lys He Gin Asp Lys Glu Gly He 
25 30 35 

CCC CCA GAC CAG CAG CGT TTG ATT TTC GCC GGA AAG CAG CTT 1070 
Pro Pro Asp Gin Gin Arg Leu He Phe Ala Gly Lys Gin Leu 
40 45 50 

GAG GAT GGT CGT ACT CTT GCC GAC TAC AAC ATC CAG AAG GAG 1112 
Glu Asp Gly Arg Thr Leu Ala Asp Tyr Asn He Gin Lys Glu 
55 60 



97 



SUBSTITUTE SHEET (RULE 26) 



WO 96/03519 



PCT/US95/09339 



TCA ACT CTC CAT CTC GTG CTC CGT CTC CGT GGT GGT GGA TCC 1154 
Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly Gly Ser 
65 70 75 

(95) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 

Mec Gin lie Phe Val Lys Thr Leu 
1 5 

Thr Gly Lys Thr He Thr Leu Glu Val Glu Ser Ser Asp Thr 
10 15 20 

He Asp Asn Val Lys Ala Lys He Gin Asp Lys Glu Gly He 
25 30 35 

Pro Pro Asp Gin Gin Arg Leu He Phe Ala Gly Lys Gin Leu 
40 * 45 50 

Glu Asp Gly Arg Thr Leu Ala Asp Tyr Asn He Gin Lys Glu 
55 60 

Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly Gly Ser 
65 70 75 

(96) INFORMATION FOR SEQ ID NO: 95: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2127 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE STRANDED 

(D) TOPOLOGY: LINEAR 
(ii) MOLECULE TYPE: 

(A) DESCRIPTION: GENOMIC DNA AND OTHER DNA 
(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 95 

TTTATCAATC AGATTTGAAC ATATAAATAA ATATAAATTG TCTCAATAAT 50 

TCTACATTAA ACTAATATTT GAAATCTCAA TTTTATGATT TTTTAAATTC 100 

ACTTTATATC CAAGACAATT TNCANCTTCA AAAAGTTTTA TTAAANATTT 150 

ACATTAGTTT TGTTGATGAG GATGACAAGA TNTTGGTCAT CAATTACATA 200 

TACCCAAATT GAATAGTAAG CAACTTCAAT GTTTTTCATA ATGATAATGA 250 

CAGACACAAN NNAAACCCAT TTATTATTCA CATTGATTGA GTTTTATATG 3 00 
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CAATATAGTA ATAATAATAA TATTTCTTAT 
TTTTAATTAT ACCACGTCAC TAAATTATAT 
AAATTTTACT TAAATATCAT GAAATAAACT 
AAATTTTTCC AATAAAAAAA AGTCATTAAG 
GGTAAANGAG TGAAGTCGAC TGACTTTTTT 
AAATTATTAA CTTTAACCTA ATAAAACACT 
AATACTTACC TCTTAGAAAT AAGAAAAAGT 
TTACATTAAA TATTTTCAAT CAAATTTAAA 
TCAATAACAA TATCAAAATA ATATGAAAAA 
GGACGATTTA AGTGCGATTA TCAAGGTAGT 
TATTTGNGCT CTTATATTTA AGGTCATGTT 
CTATATTAGA GCATATATTA AAATAAAAAA 
TTATTTTTAG TATATATTTT TTTACATGAC 
CTAAAGGAGC GTGTAAGTGT CGACCTCATT 
ATAAAAATTA AAAAGGAAAG GTAGCTTTTG 
CACCTCATTA TTACACGTGT CCT CATATAA 
GTTTCGTCTA GAGTCGGCCA TGCCATCTAT 
TCATTTTTTT CATCTTCTAT CTGATTTCTA 
CTTCAAATTT CTCTTTAAGG TTAGAATCTT 
GGTTTTTGTA TGTTTAGATT CTCGAATTAG 
CCCTTCCTTT TGAGTCTCTC CTCGGTTGTC 
TTTGAGTTTT TTTACGTCTG GTTTGATGGA 
TTTCCCCGTT CGTTTTGATG AAAAAGCCCC 
GTCGTTCGTT CTAAAGGTTT AAAATTAGAG 
AAAAGGCCTT AAATTTGAGT TTTTCCGGTT 
GAATTTGTGT TTTTCCGTCG GTTTGATTCT 
TCTCCGGCTG TTTTGATGAA AAAGCCCTAA 
TTTGATGAAA AAGCCCTAAA TTTGAAGTTT 



AAAGCAAGAG 


GTCAATTTTT 


350 


TTGATAATGT 


AAAACAATTC 


400 


ATTTTTATAA 


CCAAATTACT 


450 


AAGACATAAA 


ATAAATTTGA 


500 


TTTTTTTATC 


ATAAGAAAAT 


550 


AATATAATTT 


CATGGAATCT 


600 


GTTTCTAATA 


GACCCTCAAT 


650 


TAACAAATAT 


CAATATGAGG 


700 


AGAGCAATAC 


ATAATATAAG 


750 


ATTATATCCT 


AATTTGCTAA 


600 


CATGATAAAC 


TTGAAATGCG 


850 


ATACCTAAAA 


TAAAATTAAG 


900 


CTACATTTTT 


CTGGGTTTTT 


950 


CTCCTAATTT 


TCCCCACCAC 


1000 


CGTGTTGTTT 


TGGTACACTA 


1050 


TTGGTTAACC 


CTATGAGGCG 


1100 


AAAATGAAGC 


TTTCTGCACC 


1150 


TTATAATTTC 


TCTCAATTGC 


1200 


CTCTATTTTT 




1240 


CTAATCAGGC 


GCTGTTATAG 


1290 


TTGATGGAAA 


AGGCCTAACA 


1340 


AAAGGCCTAC 


AATTGGCCGT 


1390 


TAGTTTGAGA 


TTTTTTTTCT 


1440 


TTTTTACATT 


TGTTTGATGA 


1490 


GATTTGATGA 


AAAAGCCCTA 


1540 


GAAGGCCTAA 


AATTTGAGTT 


1590 


ATTTGAGTTT 


CTCCGGCTGT 


1640 


TTTCCCCGTG 


TTTTAGATTG 


1690 
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TTTAGGTTTT AATTCTCGAA TCAGCTAATC AGGGAGTGTG AAAGCCCTAA 1740 

ATTGAAGTTT TTTTCGTTGT TCTGATTGTT GTTTTTATGA ATTTGCAG 1788 

ATG CAG ATC TTT GTG AAA ACT CTC ACC GGA AAG ACT ATC ACC 1830 
Met Gin He Phe Val Lys Thr Leu Thr Gly Lys Thr He Thr 
15 10 

CTA GAG GTG GAA AGT TCT GAT ACA ATC GAC AAC GTT AAG GCT 1872 
Leu Glu Val Glu Ser Ser Asp Thr He Asp Asn Val Lye Ala 
15 20 25 

AAG ATC CAG GAT AAG GAA GGA ATT CCC CCG GAT CAG CAA AGG 1914 
Lys He Glu Asp Lys Glu Gly He Pro Pro Asp Gin Gin Arg 
30 * 35 40 

CTT ATC TTC GCC GGA AAG CAG TTG GAG GAC GGA CGT ACT CTA 1956 
Leu He Phe Ala Gly Lys Gin Leu Glu Asp Gly Arg Thr Leu 
45 50 55 

GCT GAT TAC AAC ATC CAG AAG GAG TCT ACC CTC CAT TTG GTG 1998 
Ala Asp Tyr Asn He Gin Lys Glu Ser Thr Leu His Leu Val 
60 65 70 

CTC CGT CTA CGT GGA GGT GGA TCC GCT GTT AAA AGA GTG GGT 2040 
Leu Arg Leu Arg Gly Gly Gly Ser Ala Val Lys Arg Val Gly 
75 80 

CGT AGG TTG AAA AAG TTG GAC CGT AAG ATT GAT AGG TTA GGA 20 B 2 
Arg Arg Leu Lys Lys Leu Asp Arg Lys He Asp Arg Leu Gly 
B5 90 95 

GTT GAT TTT TGATCTAGAG TCGACCGATC CCCCGAATTT CCCCGA 2127 
Val Asp Phe 
100 



(97) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2022 

(B) TYPE: NUCLEIC ACID 

(C) STRAND EDNESS : DOUBLE STRANDED 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: GENOMIC DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96 

TTTATCAATC AGATTTGAAC ATATAAATAA ATATAAATTG TCTCAATAAT 50 

TCTACATTAA ACTAATATTT GAAATCTCAA TTTTATGATT TTTTAAATTC 100 

ACTTTATATC CAAGACAATT TNCANCTTCA AAAAGTTTTA TTAAANATTT 150 

ACATTAGTTT TGTTGATGAG GATGACAAGA TNTTGGTCAT CAATTACATA 200 
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TACCCAAATT GAATAGTAAG CAACTTCAAT 
CAGACACAAN NMAAACCCAT TTATTATTCA 
CAATATAGTA ATAATAATAA TATTTCTTAT 
TTTTAATTAT ACCACGTCAC TAAATTATAT 
AAATTTTACT TAAATATCAT GAAATAAACT 
AAATTTTTCC AATAAAAAAA AGTCATTAAG 
GGTAAANGAG TGAAGTCGAC TGACTTTTTT 
AAATTATTAA CTTTAACCTA ATAAAACACT 
AATACTTACC TCTTAGAAAT AAGAAAAAGT 
TTACATTAAA TATTTTCAAT CAAATTTAAA 
TCAATAACAA TATCAAAATA ATATGAAAAA 
GGACGATTTA AGTGCGATTA TCAAGGTAGT 
TATTTGNGCT CTTATATTTA AGGTCATGTT 
CTATATTAGA G CATATATT A AAATAAAAAA 
TTATTTTTAG TATATATTTT TTTACATGAC 
CTAAAGGAGC GTGTAAGTGT CGACCTCATT 
ATAAAAATTA AAAAGGAAAG GTAGCTTTTG 
CACCTCATTA TTACACGTGT CCTCATATAA 
GTTTCGTCTA GAGTCGGCCA TGCCATCTAT 
TCATTTTTTT CATCTTCTAT CTGATTTCTA 
CTTCAAATTT CTCTTTAAGG TTAGAATCTT 
GGTTTTTGTA TGTTTAGATT CTCGAATTAG 
CCCTTCCTTT TGAGTCTCTC CTCGGTTGTC 
TTTGAGTTTT TTTACGTCTG GTTTGATGGA 
TTTCCCCGTT CGTTTTGATG AAAAAGCCCC 
GTCGTTCGTT CTAAAGGTTT AAAATTAGAG 
AAAAGGCCTT AAATTTGAGT TTTTCCGGTT 
GAATTTGTGT TTTTCCGTCG GTTTGATTCT 



GTTTTTCATA 


ATGATAATGA 


250 


CATTGATTGA 


GTTTTATATG 


300 


AAAGCAAGAG 


GTCAATTTTT 


350 


TTGATAATGT 


AAAACAATTC 


400 


ATTTTTATAA 


CCAAATTACT 


450 


AAGACATAAA 


ATAAATTTGA 


500 


TTTTTTTATC 


ATAAGAAAAT 


550 


AATATAATTT 


CATGGAATCT 


600 


GTTTCTAATA 


GACCCTCAAT 


650 


TAACAAATAT 


CAATATGAGG 


700 


AGAGCAATAC 


ATAATATAAG 


750 


ATTATATCCT 


AATTTGCTAA 


800 


CATGATAAAC 


TTGAAATGCG 


850 


ATACCTAAAA 


TAAAATTAAG 


900 


. CTACATTTTT 


CTGGGTTTTT 


950 


CTCCTAATTT 


TCCCCACCAC 


1000 


CGTGTTGTTT 


TGGTACACTA 


1050 


TTGGTTAACC 


CTATGAGGCG 


1100 


AAAATGAAGC 


TTTCTGCACC 


1150 


TTATAATTTC 


TCTCAATTGC 


1200 


CTCTATTTTT 




1240 


CTAATCAGGC 


GCTGTTATAG 


1290 


TTGATGGAAA 


AGGCCTAACA 


1340 


AAAGGCCTAC 


AATTGGCCGT 


1390 


TAGTTTGAGA 




1440 


TTTTTACATT 


TGTTTGATGA 


1490 


GATTTGATGA 


AAAAGCCCTA 


1540 


GAAGGCCTAA 


AATTTGAGTT 


1590 
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TCTCCGGCTG TTTTGATGAA AAAGCCCTAA ATTTGAGTTT CTCCGGCTGT 1640 

TTTGATGAAA AAGCCCTAAA TTTGAAGTTT TTTCCCCGTG TTTTAGATTG 1690 

TTTAGGTTTT AATTCTCGAA TCAGCTAATC AGGGAGTGTG AAAGCCCTAA 1740 

ATTGAAGTTT TTTTCGTTGT TCTGATTGTT GTTTTTATGA ATTTGCAG 1788 

ATG CAG ATC TTT GTG AAA ACT CTC ACC GGA AAG ACT ATC ACC 1830 
Met Gin He Phe Val Lys Thr Leu Thr Gly Lys Thr He Thr 
15 10 

CTA GAG GTG GAA AGT TCT GAT ACA ATC GAC AAC GTT AAG GCT 1872 
Leu Glu Val Glu Ser Ser Asp Thr He Asp Asn Val Lys Ala 
15 20 25 

AAG ATC CAG GAT AAG GAA GGA ATT CCC CCG GAT CAG CAA AGG 1914 
Lys He Glu Asp Lys Glu Gly He Pro Pro Asp Gin Gin Arg 
30 " 35 40 

CTT ATC TTC GCC GGA AAG CAG TTG GAG GAC GGA CGT ACT CTA 1956 
Leu He Phe Ala Gly Lys Gin Leu Glu Asp Gly Arg Thr Leu 
45 50 55 

GCT GAT TAC AAC ATC CAG AAG GAG TCT ACC CTC CAT TTG GTG 1998 
Ala Asp Tyr Asn He Gin Lys Glu Ser Thr Leu His Leu Val 
60 65 70 

CTC CGT CTA CGT GGA GGT GGA TCC 2022 
Leu Arg Leu Arg Gly Gly Gly Ser 
75 
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(98) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC „„„„ 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97 

Lys Arg Lys Arg Ala Val Lys Arg Val Gly Arg Arg Leu Lys Lys Leu 

Ala Arg Lys He Ala Arg Leu Gly Val Ala Lys Leu Ala Gly Leu Arg 
20 25 30 



Ala Val Leu Lys Phe 
35 



(99) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE : 

' (A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98 

Ala Val Lys Arg Val Gly Arg Arg Leu Lys Lys Leu Asp Arg Lys He 
1 5 10 15 



Asp Arg Leu Gly Val Asp Phe 
20 
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Claims 

What is claimed is: 

1. A polypeptide comprising a ubiquitin- lytic 
peptide fusion protein wherein ubiquitin is linked at 
its 3' terminus to a lytic peptide. 

2. The polypeptide of claim 1 wherein the 
ubiquitin polypeptide is described by sequence SEQ ID 
NO. : 94 . 

3 . The polypeptide of claim 1 wherein the lytic 
peptide is selected from the group consisting of 
cecropins, defensins, sarcotoxins, melittin and 
magainins . 

4 . The polypeptide of claim 1 wherein the lytic 
peptide is a synthetic analog of a naturally occurring 
lytic peptide. 

20 5 . The polypeptide of claim I wherein the lytic 

peptide is selected from SEQ ID Nos . : 1-91 and 97-9B. 

6. A recombinant DNA .molecule comprising a 
nucleotide sequence encoding a ubiquitin- lytic peptide 

25 fusion protein. 

7. The recombinant DNA molecule of claim 6 
wherein the sequence encoding the 5 ' fusion ubiquitin 
peptide comprises nucleotide resides 921 to 1154 of SEQ 

30 ID NO. : 93 . 



8. The recombinant DNA molecule of claim 4 
wherein the sequence encoding the 2' fusion lytic 
peptide is selected from nucleotides sequences coding 
35 for the peptides of SEQ ID Nos : 1-91 and 97-98. 
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